(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 
International Bureau 




i mil iiaim 11 nim iiiiniiiEniii im m m iimiiiiiuiii inn urn mi umi mi ini nil 



(43) International Publication Date (10) International Publication Number 

8 January 2004 (08.01.2004) PCT WO 2004/003013 Al 



(51) International Patent Classification 7 : C07K 14/415 



(21) International Application Number: 



PCT/EP2003/006757 



(22) International Filing Date: 26 June 2003 (26.06.2003) 



(25) Filing Language: 

(26) Publication Language: 

(30) Priority Data: 

0214896.3 



English 
English 

27 June 2002 (27.06.2002) GB 



(71) Applicant (for all designated States except US): NOVAR- 
TIS FORSCHUNGSSTIFTUNG, ZWEIGNEEDER- 
LASSUNG FRIEDRICH MIESCHER INSTITUTE 
FOR BIOMEDICAL RESEARCH [CH/CH]; Maulbeer- 
strasse 66, CH-4058 Basel (CH). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): FRITSCH, Olivier 
[FR/FR]; 9, Rue des Artisans, F-68000 Colmar (FR). 
HOHN, Barbara [AT/CH]; Hangstrasse 35, CH-4144 
Arlesheim (CH). LUCHT, Jan, Martin [DE/CH]; Unterer 
Batterieweg 113, CH-4059 Basel (CH). 



(74) Common Representative: NOVARTTS 
FORSCHUNGSSTIFTUNG, ZWEIGNIEDERLAS- 
SUNG FRIEDRICH MIESCHER INSTITUTE FOR 
BIOMEDICAL RESEARCH; Maulbeerstrasse 66, 
CH-4058 Basel (CH). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, H, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT. LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VN, 
YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

m 



(54) Title: GENE FOR INCREASED SOMATIC RECOMBINATION 



O (57) Abstract: The present invention relates to nucleic acids encoding polypeptides involved in homologous recombination, as well 
S as vectors and host cells comprising the nucleic acids and polypeptides encoded by the nucleic acids. Also provided are methods 
for inducing somatic and/or meiotic homologous recombination in a cell, comprising modulating the expression or properties of one 
Q or more gene products selected from the group consisting of Atlno80, At3g57300, Rvbl (At5g22330), Rvb21 (At5g67630), Rvb22 
^ (At3g49830), At3g57290, AtArp5.1 (At3gl2380), AtArp5.2 (At5g56l80), AtArp5.3 (At3g60830) and AtArp8 (At5g43500), their 
^ homologues, fragments or derivatives. In particular, the methods can be used to increase gene targeting. 



Pramod B. Mahajan 
Serial No. 10/782,435 



Appendix E 



WO 2004/003013 



PCT/EP2003/006757 



Gene for increased somatic recombination 
TECHNICAL FIELD 

The present invention relates to DNA that encodes proteins that control somatic 
recombination, in particular in plants. 

BACKGROUND 

Cells of all organisms have evolved a series of DNA repair pathways that counteract the 
deleterious effects of DNA damage and are triggered by intricate signal cascades. 
Homologous recombination in plants stabilizes the genome by repairing damaged 
chromosomes simultaneously generating genetic variability through the creation of new 
genes and new genetic linkages. Repair of DNA damage by recombination is particularly 
significant for cells under exogenous and endogenous genotoxic stress because of its 
potential to remove a wide range of DNA lesions. The current understanding of genetic and 
molecular components underlying meiotic and somatic recombination and DNA repair in 
plants is limited. To be able to modify or improve DNA repair using gene technology it is 
necessary to identify key proteins involved in said pathways or cascades. 

The precise manipulation of the genome of higher plants is still a major challenge for plant 
genetic engineering. Some advances have been made recently for the creation of point 
mutations at predetermined positions by chimeric RNA/DNA oligonucleotides (Beetham et al. 
1999, Hohn & Puchta 1999, Zhu et ai. 1999, Kipp et al. 2000, Zhu et al. 2000). However, the 
targeted insertion of longer stretches of DNA sequence at any desired location ("knock-in") 
or the replacement of predetermined plant genomic sequences by heterologous DNA 
("knock-out) via homologous recombination is at present not possible as a routine technique 
(Mengfste & Paszkowski 1999, Puchta 2002). 

Few reports have appeared in the literature that describe successful "gene targeting" in 
higher plants (Paszkowski et al. 1988, Lee et al. 1990, Offringa et al. 1990, Miao & Lam 
1995, Kempin et al. 1997, Hanin et al. 2001), but the reported absolute numbers and relative 
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frequencies of the desired events were very low. Indeed, the main problem for "gene 
targeting" experiments is the low frequency of the desired homologous recombination events 
- 10" 3 to 10* 5 (Hohn & Puchta 1999, Mengiste & Paszkowski 1999) - relative to illegitimate 
recombination/integration events. 

Various attempts of increasing the low relative frequency of targeted homologous 
recombination events, by improved selection schemes ("positive-negative selection") or by 
providing extended regions of sequence homology, were not successfui (Thykjaer et a). 
1997, Gallego et al. 1999). One promising strategy to facilitate gene targeting in higher 
plants would be to shift the balance between illegitimate and homologous recombination 
events towards the latter, by facilitating homologous recombination events in plants by 
genetic manipulation (Gherbi et al. 2001). 

One approach described in the literature is the expression in plants of heterologous proteins 
known to be involved in homologous recombination. Overproduction of the bacterial 
resolvase RuvC was shown to increase somatic inter-and intra-chromosomal recombination, 
as well as extrachromosomai recombination (Shalev et al. 1999), but no gene targeting 
studies were reported yet with this system. Expression of the bacterial RecA protein had 
similar effects (Reiss et al. 1996, Reiss et al. 1997), but subsequent experiments did not 
show an increase of gene targeting events (Reiss et al. 2000). So far, it is not clear whether 
heterologous proteins can successfully interact with the plant recombination machinery to 
affect the outcome of the recombination events required for gene targeting. In addition, 
these foreign proteins might have undesired side effects in plants. 

An alternative approach is to rely on endogenous plant genes to influence the frequency of 
homologous recombination events. So far, indirect approaches have been reported to isolate 
plant genes involved in recombination. The cloning of plant orthologs to recombination and 
repair genes from other species was reported (Klimyuk & Jones 1997, Doutriaux et al. 1998, 
Hartung & Puchta 1999, Gallego et al. 2000, Lin et al. 2000), but so far the importance of 
these genes for recombination in plants has only been evaluated for the RAD50 homologue 
(Gherbi et a., 2001). Functional screens have been carried out to identify plant mutants 
hypersensitive to genotoxic treatments (Davies et al. 1994, Jenkins et al. 1995, Jiang et al. 
1997, Masson et al. 1997, Albinsky et al. 1999, Mengiste et al. 1999). Since recombination is 
an important mechanism for DNA repair, some of these mutants might be affected in their 
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recombination behavior. This was experimentally demonstrated for some X-ray 
hypersensitive Arabidopsis mutants that also showed reduced levels of somatic 
recombination (Masson & Paszkowski 1997), although the affected gene has not been 
isolated. Recently, a DNA damage hypersensitive Arabidopsis mutant was isolated from a T- 
DNA tagged population, the affected gene (MIM) was cloned and shown to encode an SMC 
(Structural Maintenance of Chromatin) protein. Since the mim mutant showed decreased 
frequencies of somatic recombination, MIM seems be involved in some aspect of somatic 
recombination (Mengiste et a!. 1999). Also in tobacco a hyperrecombinogenic mutant was 
isolated (Gorbunova et al. 2000). However, the gene affected could not be isolated so far. 

Previously, a genetic system was described to study somatic homologous recombination 
between repeated sequences in whole plants (Swoboda et al. 1994, Puchta et al. 1995a, 
Puchta et al. 1995b). Briefly, a transgene carrying two non-functional halves of the p- 
glucuronidase reporter gene sharing a stretch of sequence identity serves as a reporter 
construct. Homologous recombination between the repeated sequences results in the 
restoration of a functional reporter gene. Such events were detected by a sensitive 
histochemica! assay, and confirmed by Southern blotting. This assay is destructive, since the 
staining procedure is lethal, so that direct isolation of mutants is difficult. 

There is a need in the art to identify genes that increase somatic recombination and this 
invention meets that need. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts an alignment of an Atlno80 sequence (SEQ ID NO:1) and public sequence, 
At3g57300 (SEQ ID NO:3), showing a splicing difference ("Query" refers to AtlnoSO 
sequence;J£bjct" to public database sequence, gi|18410689|ref|NMJ 15590.1| 
(AGI:At3g57300) 

SUMMARY OF THE INVENTION 

The present invention provides an isolated nucleic acid, in particular DNA, comprising a 
sequence having 98.5% or more identity with the sequence depicted in SEQ ID NO: 1 
(Atlno80). Also provided are vectors and host cells comprising the nucleic acids of the 
invention, as well as polypeptides encoded by the nucleic acids. 

In a further aspect of the invention, a method for inducing homologous recombination in a 
cell is provided, comprising modulating the expression or properties of one or more gene 
products selected from the group consisting of Atlno80 (SEQ ID NOs:1 and 2), At3g57300 
(SEQ ID NO:3), Rvbl (At5g22330; SEQ ID Nos; 4 and 5), Rvb21 (At5g67630; SEQ ID NOs; 
6 and 7), Rvb22 (At3g49830: SEQ ID NOs: 8 and 9), At3g57290 (SEQ ID NO: 10), AtArp5.1 
(At3g12380; SEQ ID NOs: 11 and 12), AtArp5.2 (At5g56180; SEQ ID NOs: 13 and 14), 
AtArp5.3 (At3g60830; SEQ ID Ns: 15 and 16 ) and AtArp8 (At5g43500; SEQ ID Nos: 17 and 
18), their homologues, fragments or derivatives. In one embodiment, modulation is achieved 
by increasing expression of the gene product, such as by introducing a nucleic acid encoding 
the gene product into the cell operably linked to a promoter; and allowing transcription and 
translation of the gene in an amount sufficient to affect homologous recombination in said 
cell. 

The method can be used to increase somatic homologous recombination and/or meiotic 
homologous recombination. The promoter can be an inducible promoter, a tissue-specific 
promoter, a constitutive promoter or a meiosis-specific promoter, depending on the desired 
effect. 
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Also provided is a method of increasing gene targeting to a desired locus in a host cell 
comprising introducing a desired gene into a host cell, modulating the expression or 
properties of one or more gene products selected from the group consisting of AtlnoSO, 
At3g57300, Rvb1, Rvb21, Rvb22, At3g57290, AtArp5.1, AtArp5.2, AtArp5.3 and AtArp8, or 
functional fragments, derivatives and homologues thereof in the host cell, and detecting 
integration of the desired gene at a selected locus in the genome of the host cell. 

DETAILED DESCRIPTION OF THE INVENTION 

The present inventors have used a direct screening approach to identify mutants of 
Arabidopsis thaliana showing increased frequencies of somatic recombination, by visualizing 
recombination events in living plants from a mutagenized population and directly isolating 
plants with the desired phenotype. The description below describes a genetic screen and an 
Arabidopsis mutant sm22 derived from it, and the associated plant genes responsible for the 
altered recombination phenotype. 

Existing technologies for gene targeting in plants are very inefficient The modulation of the 
expression or properties of one or more gene products selected from the group consisting of 
Atlno80 (SEQ ID NOs:1 and 2), At3g57300 (SEQ ID NO:3), Rvb1 (At5g22330; SEQ ID Nos: 
4 and 5), Rvb2(1 and 2; also referred to herein as Rvb21 , At5g67630; SEQ ID NOs: 6 and 7 
and Rvb22, At3g49830: SEQ ID NOs: 8 and 9, respectively), At3g57290 (SEQ ID NO: 10), 
AtArp5.1 (At3g12380; SEQ ID NOs: 11 and 12), AtArp5.2 (At5g56180; SEQ ID NOs: 13 and 
14), AtArp5.3 (At3g60830; SEQ ID Ns: 15 and 16 ) and AtArp8 (At5g43500; SEQ ID Nos: 17 
and 18), their homologues (including orthologs), fragments or derivatives increases the 
efficiency of gene targeting events and facilitates the routine manipulation of the genome of 
higher plants by homologous recombination. For the purposes of this disclosure, to avoid 
repetition, reference to the above group of gene products is meant to include reference to 
each gene individually, i.e., the modulation of the expression or properties of Atlno80, the 
modulation of the expression or properties of At3g57300, and so on. 

An in vivo screen for Arabidopsis mutants has been devised to allow direct detection of 
mutants with altered recombination. As a result of the screen, mutant plants with a more 
than 10-fold increased or altered frequency of somatic recombination events are provided, 
as well as the plant gene Atlno80. One or more of AtlnoSO, At3g57300, Rvb1, Rvb21, 
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Rvb22, At3g57290, AtArp5.1, AtArp5.2, AtArp5.3 and AtArp8 or orthologs from other plant 
species are affected in these mutant plants. The screen allows the identification of mutant 
plants, and plant genes with a strong effect on recombination having little or no undesired 
side effects on the plant. An increase in homologous recombination frequency is useful to 
achieve an increased efficiency of gene targeting in plants. 

Within the context of the present invention reference to a gene is to be understood as 
reference to a DNA coding sequence associated with regulatory sequences, which allow 
transcription of the coding sequence into RNA such as mRNA, rRNA, tRNA, snRNA, sense 
RNA or antisense RNA. Examples of regulatory sequences are promoter sequences, 5' and 
3' untranslated sequences, introns, and termination sequences. 

A promoter is understood to be a DNA sequence initiating transcription of an associated 
DNA sequence, and may also include elements that act as regulators of gene expression 
such as activators, enhancers, or repressors. 

Expression of a gene refers to its transcription into RNA or its transcription and subsequent 
translation into protein within a living cell. In the case of antisense constructs expression 
refers to the transcription of the antisense DNA only. 

The term transformation of ceils designates the introduction of nucleic acid into a host cell, 
particularly the stable integration of a DNA molecule into the genome of said cell. 
Any part or piece of a specific nucleotide or amino acid sequence is referred to as a 
component sequence or fragment . 

In one aspect of the invention, nucleic acids and polypeptides are provided that can 
modulate homologous recombination. A nucleic acid according to the present invention 
comprises a sequence having 98.5%, 99%, 99.5% or more identity with the sequences 
depicted in SEQ ID NO:1. The nucleic acid can be DNA or RNA, such as, mRNA, rRNA, 
tRNA, snRNA, sense RNA or antisense RNA. Also provided is a vector comprising the 
nucleic acid of the invention, as well as host cells comprising the vector or nucleic acid of the 
invention. Suitable vectors and host cells are described in more detail below. Also provided 
are polypeptides encoded by the nuc/eic acids of the invention. 
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In a further aspect of the invention, methods for increasing homologous recombination are 
provided by modulating the expression or properties of one or more gene products selected 
from the group consisting of At!no80, At3g57300, Rvb1, Rvb21, Rvb22, At3g57290, 
AtArp5.1, AtArp5.2, AtArp5.3 and AtArp8. In order to increase homologous recombination 
several methods are useful depending on the gene and the gene targeting technique 
employed. Typically, modulation will mean increasing the activity of the gene product, which 
can easily be achieved by methods known in the art. 

In one embodiment, the desired gene is overexpressed in a host cell in an amount sufficient 
to increase homologous recombination in the host cell. By "overexpression", it is meant 
increasing the amount of desired gene product in a host cell, compared to untreated cells. A 
simple way to achieve overexpression is to produce transgenic host cells, in particular 
transgenic plants, carrying a construct (vector) that ectopically overexpresses the sequence 
of interest under the control of a suitable promoter, such as the 35S CaMV, MAS 
(mannopine synthase) or ubiquitin promoter. 

In another embodiment, an inducible promoter is used to allow an increase in homologous 
recombination frequency at the time and place needed, for example, for gene targeting. 

Alternatively, the construct increasing recombination can be provided at the same time as 
the targeting construct by co-transformation, the effect is then achieved by the transient 
expression of the construct containing the said genes. 

Functional fragments, homologues (including orthologs) or derivatives can be easily 
identified by alignment with the sequences referred to above. In general two approaches 
exist to sequence alignment. Algorithms as proposed by Needleman & Wunsch and by 
Sellers align the entire length of two sequences providing a global alignment of the 
sequences. The Smith-Waterman algorithm on the other hand yields local alignments. A 
local alignment aligns the pair of regions within the sequences that are most similar given 
the choice of scoring matrix and gap penalties. This allows a database search to focus on 
the most highly conserved regions of the sequences. It also allows similar domains within 
sequences to be identified. To speed up alignments using the Smith-Waterman algorithm 
both BLAST (Basic Local Alignment Search Tool) and FASTA place additional restrictions on 
the alignments. 
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Within the context of the present invention alignments are conveniently performed using 
BLAST, a set of similarity search programs designed to explore all of the available sequence 
databases regardless of whether the query is protein or DNA. Version BLAST 2.0 (Gapped 
BLAST) of this search tool has been made publicly available on the internet (currently 
httpy/vvww.ncbi.nfm.nih.gov/BLAST/). It uses a heuristic algorithm which seeks local as 
opposed to global alignments and is therefore able to detect relationships among sequences 
which share only isolated regions. The scores assigned in a BLAST search have a well- 
defined statistical interpretation. Particularly useful within the scope of the present invention 
are the blastp program allowing for the introduction of gaps in the local sequence alignments 
and the PSI-BLAST program, both programs comparing an amino acid query sequence 
against a protein sequence database, as well as a blastp variant program allowing local 
alignment of two sequences only. Said programs are preferably run with optional parameters 
set to the default values. 

Sequence alignments using BLAST can also take into account whether the substitution of 
one amino acid for another is likely to conserve the physical and chemical properties 
necessary to maintain the structure and function of the protein or is more likely to disrupt 
essential structural and functional features of a protein. Such sequence similarity is 
quantified in terms of a percentage of "positive" amino acids, as compared to the percentage 
of identical amino acids and can help assigning a protein to the correct protein family in 
border-line cases. 

Specific examples of DNA and encoded proteins according to the present Invention are 
described in SEQ ID NOS: 1-18. The putative ATPase/helicase Atlno80 may, as 
demonstrated in yeast, be part of a complex containing one or more of Rvb1, Rvb2, Arp5 
and Arp8. All these proteins may be useful in increasing homologous recombination 
frequency. 

Typically, functional fragments or derivatives are characterized by an amino acid sequence 
comprising a component sequence of at least 150 amino acid residues having 40% or more 
identity with an aligned component sequence of the one or more of the polypeptides 
encoded by the DNA of SEQ ID NOs: 1, 3, 4, 6, 8, 10, 11, 13, 15, 16 or 18. Preferably the 
amino acid sequence identity is higher than 50% or even higher than 55%. Most preferably 
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the protein sequence is that of SEQ ID NO:2. 

DNA encoding proteins according to the present invention can be isolated from 
monocotyledonous and dicotyledonous plants. Preferred sources are corn, sugarbeet, 
sunflower, winter oilseed rape, soybean, cotton, wheat, rice, potato, broccoli, cauliflower, 
cabbage, cucumber, sweet com, daikon, garden beans, lettuce, melon, pepper, squash, 
tomato, or watermelon. However, they can also be isolated from mammalian sources such 
as mouse or human tissues. The following general method, can be used, which the person 
skilled in the art knows to adapt to the specific task. A single stranded fragment of the 
desired gene consisting of at least 15, preferably 20 to 30 or even more than 100 
consecutive nucleotides is used as a probe to screen a DNA library for clones hybridizing to 
said fragment The factors to be observed for hybridization are described in Sambrook et al, 
Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory Press, chapters 
9.47-9.57 and 11.45-11.49, 1989. Hybridizing clones are sequenced and DNA of clones 
comprising a complete coding region encoding a protein characterized by an amino acid 
sequence comprising a component sequence of at least 150 amino acid residues having 
40% or more sequence identity to the protein sequence encoded by the desired gene is 
purified. Said DNA can then be further processed by a number of routine recombinant DNA 
techniques such as restriction enzyme digestion, ligation, or polymerase chain reaction 
analysis. The disclosure of the nucleotide sequences in SEQ ID NOs: 1, 3, 4, 6, 8, 10, 11, 
13, 15, 16 and 18 enables a person skilled in the art to design oligonucleotides for 
polymerase chain reactions which attempt to amplify DNA fragments from templates 
comprising a sequence of nucleotides characterized by any continuous sequence of 15 and 
preferably 20 to 30 or more basepairs of the desired gene. 

Svitabie vectors for practicing the methods of the invention are well known in the art 
Similalry, host cells can be derived from monocotyledonous or dicotyledonous plants. 
Preferred sources are corn, sugarbeet, sunflower, winter oilseed rape, soybean, cotton, 
wheat, rice, potato, broccoli, cauliflower, cabbage, cucumber, sweet corn, daikon, garden 
beans, lettuce, melon, pepper, squash, tomato, or watermelon. However, host cells can also 
be isolated from other sources, including mammalian sources such as mouse or human 
cells, in particular stem cells. It is preferred that mammalian homologues are used in 
mammalian cells. 
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The methods for increasing homologous recombination are useful to obtain gene targeting 
so that a gene of interest is introduce into the genome at a desired locus, instead of 
randomly. For some hosts, in particular crop plants, the gene is preferably expressed in a 
selected tissue where expression is needed. This is easily achieved by the use of tissue 
specific promoter. Thus, the present invention provides a method for increasing somatic 
homologous recombination and increasing gene targeting by modulating the expression or 
properties of one or more gene products selected from the group consisting of AtlnoSO, 
At3g57300, Rvb1, Rvb21, Rvb22, At3g57290, AtArp5.1, AtArp5.2, AtArp5.3 and AtArp8 and 
fragments, derivatives and homologues thereof, essentially as described above. As is 
apparent to one of ordinary skill in the art, the corresponding ortholog is preferably used for 
any particular plant. For example, the com ortholog of Ino80 is used (or modulated) to 
increase recombination in com. 

The methods are also useful to improve meiotic recombination, thereby facilitating breeding 
of species, in which genes encoding a particular phenotype are transferred between plants. 
Crossing in an interesting trait from another variety or species into a given variety by 
conventional breeding is a very time and labour-intensive process. Several generations of 
back-crosses have to be carried out to eliminate the undesired genetic material of the donor 
species, while maintaining the desired phenotype or trait. Using the methods described 
above for increasing homologous recombination, meiotic recombination frequencies can be 
increased, preferably by expressing the desired gene under the control of a meiosis-specific 
promoter or inducible promoter, the breeding process is speeded up. Thus, the present 
invention provides a method for increasing meiotic recombination by modulating the 
expression or properties of one or more gene products selected from the group consisting of 
Atlno80, At3g57300, Rvb1, Rvb21, Rvb22, At3g57290, AtArp5.1, AtArp5.2, AtArp5.3 and 
AtArp8 and fragments, derivatives and homologues thereof, essentially as described above. 

The Examples below are provided for illustrative purposes and are in no way intended to be 
limiting to the invention. 
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EXAMPLES : 

Example 1 : Identification of sm22 gene effective in increasing homologous gene 
recombination. 

We have used for our screening a newly constructed a transgenic Arabidopsis thaliana line 
that carries a recombination reporter construct based on the firefly luciferase gene. The 
structure of the reporter construct - two segments of the luciferase gene arranged as 
inverted repeats - is comparable to that of the previously described beta-glucuronidase 
reporter (Swoboda et al. 1994, Puchta et al. 1995a, Puchta et af. 1995b), but offers the 
advantage that recombination events can be detected in living plants. Luciferase activity in 
cells in which recombination has restored an intact luciferase gene can be detected by light 
emission after application of the substrate D-luciferin using a high-sensitivity CCD camera 
(Millar et al. 1992, Millar et al. 1995a, Millar et al. 1995b, Michelet & Chua 1996). 

To induce hyperrecombination mutations in the luciferase recombination reporter line, we 
used T-DNA activation tagging with a mutagenic construct (pAC102). "Activation tagging" 
refers to the transcriptional activation of endogenous plant genes by random integration of a 
construct that carries promoter or enhancer sequences. One published approach for 
"activation tagging" is the introduction, via Agrobacterium-mediated gene transfer, of a T- 
DNA carrying severaJ copies of the cauliflower mosaic virus (CaMV) 35S enhancer (Fang et 
ai. 1989), which can activate the expression of heterologous genes over a distance (Hayashi 
et al. 1992, Walden et al. 1994, Kakimoto 1996, Kardailsky et al. 1999, Weigel et al. 2000). 
Another published approach is the introduction of a complete, outward-pointing CaMV 35S 
promoter on a transposable Ds element (Wilson et al. 1996, Schaffer et al. 1998, Fridborg et 
al. 1999). The construct w pAC102" used for our experiments is a combination of these 
previously described elements: it is a binary vector carrying a T-DNA that can be transferred 
to plants that contains a complete, outward-pointing copy of the CaMV 35S 
promoter/enhancer close to the right T-DNA border. Thus, this construct combines the ease 
of application of T-DNA gene transfer with the genetic ability of a complete promoter, 
avoiding some of the drawbacks of enhancer-only constructs (Weigel et al. 2000). 

In principle, the activation tagging construct can cause several kinds of mutations after 
integration in the plant genome: gene disruption by insertion within a coding sequence, 
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activation of plant gene expression by action of the CaMV 35S enhancer, direct expression 
of a plant gene from the CaMV 35S promoter on the T-DNA, or down-regulation of 
expression by antisense RNA production driven from the CaMV 35S promoter. The pAC102 
T-DNA carries in addition to the 35S promoter a complete copy of the pUC cloning vector to 
facilitate gene cloning by plasmid rescue (Dilkes & Feldmann 1998), and a sulfonamide 
resistance marker (Guerineau et al. 1990, Reiss et al. 1996) for selection of transgenic 
plants. 

We transformed 13.000 three-week old Arabidopsis ecotype Columbia plants from the 
luc'rferase recombination reporter line with the activation-tagging T-DNA construct ll pAC102" 
by Agrobacterium-mediated gene-transfer, using the established "floral dip" procedure 
(Clough & Bent 1998) with a modified infiltration buffer, in which the Silwet L-77 detergent 
was replaced by 0.05% Extravon® (Ciba). Seeds from the infiltrated plants were harvested 
three weeks after infiltration. Transgenic progeny carrying the pAC102 activation tagging T- 
DNA were selected by sowing seeds on perlite substrate drenched with Gamborg B5 mineral 
medium (Gamborg et al. 1968) containing 10 mg/I sulfadiazine (Sigma), and transferring 
surviving individuals after 10 days to soil. About 20,000 sulfonamide-resistant plants were 
isolated; they represent independent transformants with the pAC102 T-DNA activation 
tagging construct integrated at different random positions in the Arabidopsis genome. 

When individual plants had grown to the 10-leaf stage, they were assayed for lucif erase 
activity to detect somatic recombination events. Batches of 25 plants were sprayed with the 
substrate D-luciferin and pictures (typically two) were taken with a "Astrocam" (Gloor 
Instruments, Uster) by integrating photons over 15 min. Background noise and cosmic 
radiation was filtered out by correlating both images using the minimum function. Plants 
showing an increased number of sectors with lucif erase activity relative to the average of the 
population were observed with a frequency of about 1 in 500 plants. 

As a result of the screen, a hyperrecombination mutant plant was isolated called sm22. The 
original hyper-recombination phenotype of sm22 plant shows an enhancement of about 20- 
to 50- fold for homologous recombination in the reporter line. No other obvious phenotype 
was seen and the seed yield was normal. Sulfonamide selection in the second generation 
(T2) reveafed a 2/1 or 3/1 segregation of resistant seedlings, thus showing that there is only 
1 locus (or 2 closely related loci) with an active T-DNA inserted. However, the T2 
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recombination phenotype was even lower (less or same number of recombination events per 
plant) than in the wild type. 

After Hindlll digestion of T1 callus genomic DNA prepared essentially according to the 
method of the Nucleon Phytopure protocol and Plant DNA extraction kit (Amersham), 
pfasmid rescue was applied (Dilkes & Feldmann 1998, Mathur et at. 1998), which gave rise 
to two independent junction fragments. Briefly, we digested plant genomic DNA with HinDlll, 
circularized the resulting fragments by ligation at low DNA concentration, and transformed 
the ligation mixture into competent E. coli TOP10 cells (commercially available from 
INVITROGEN) by electroporation. Since the Hindlll fragments containing the fusion joint 
between plant DNA and the right end of the activation tagging construct carry a plasmid 
origin and the ampicillin resistance gene (bla) contributed by pAC102, circularization of such 
fragments will result in a functional bacterial plasmid and confer ampicillin resistance to the 
E. coli cells. 

Several colonies were obtained after plating the transformed bacteria on selection medium 
containing ampicillin. Plasmid DNA of these transformants was prepared and characterized 
by restriction analysisTo determine the nature of the plant sequences joined to the right end 
of the T-DNA, the plant DNA insert from these rescued plasmids was sequenced from both 
sides, using one custom sequencing primer complementary to the T-DNA right end reading 
towards the plant DNA, and the standard M13 reverse sequencing primer, reading from the 
pAC102 vector sequences into the plant DNA insert from the other end. The obtained DNA 
sequences were compared to the GenBank nucleotide database using the BLASTN search 
program. 

Two insertions were identified. The first one corresponds to a single T-DNA insertion without 
deletion (left border, LB, junction sequenced) in the N-terminal region of a putative 
ATPase/helicase gene, in antisense orientation. The second T-DNA inserted in a gene with 
no obvious relationship to homologous recombination (gb AF082176_1) and does not confer 
sulfonamide resistance. Six (T3) resistant families were analysed by PCR and Southern. 
Only one family contained some plants with the second insertion whereas all families have 
the helicase insertion site. 
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ln subsequent generations, homozygous plants for the helicase insertion site were obtained. 
The homologous recombination frequency of heterozygous and homozygous plants for this 
insertion site was at least 50% and 15%, respectively, and up to 80% and 20%, respectively, 
of the wild type level. 

The complete cDNA (4.8kb) was cloned in two steps. First, a public EST containing the 3' 
part was sequenced. Then the 5' part of the cDNA was amplified by RT-PCR on Col-0 
(Arabidopsis Columbia ecotype, wild type) callus RNA (prepared with the Qiagen RNAeasy 
Plant Kit), using primers in the 5' untranslated region including a stop codon in frame with 
the predicted ATG (sm5UT) to make sure that the complete 5' part of the cDNA was 
amplified. The primer sequences were sm5UT: ctagaagcttttaaggatTA4gactctcc (SEQ ID 
NO:19) and for 3' primer: ctcgtatgtatcccccttctcc (SEQ ID NO:20). The coding sequence is 
provided as SEQ ID NO.1 and is similar to but not identical to At3g57300 (see Fig.1). 

The predicted helicase gene (8kb genomic DNA) has about 23 exons encoding a protein of 
1507 amino acids (SEQ ID NO:2). It is predicted to be an ATPase of the Swi2/Snf2 family, 
and contains several nuclear localization signals (NLS). The ATPase/heiicase gene is the 
putative Arabidopsis ortholog of the yeast 1no80p/YGL150c protein (Ebbert et al. (1999), 
Shen et al. (2000)). Homologs exist in yeast, budding yeast, Drosophila and human. These 
four homologues have several highly conserved regions including the six motifs of the 
SWI2/SNF2 helicase domain. Several NLS suggest a nuclear localization of the gene 
product. 

in the sm22 heterozygous and homozygous mutant, the level of Ino80 transcript is 
respectively about 50% of the wild type situation or absent, as measured by semi- 
quantitative RT-PCR (5* At3g57300 primer: TGATGGATCTATCACCATCAG, SEQ ID 
NO:21; 3' At3g57300 primer: ggtgggattccaatcactttc, SEQ ID NO:22) and by northern biot 
hybridization. For this, the RNA was extracted from 2 weeks old seedlings using the 
QIAGEN RNAeasy plant extraction kit following the manufacturer's instructions. Together 
with the decrease of homologous recombination in sm22 plants described above (50% in 
heterozygous plants, 15% in homozygous mutant), this result shows that the level of Ino80 
gene product might positively and directly regulate the homologous recombination 
frequency, making this gene a choice candidate to positively regulate homologous 
recombination. 
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Results with a recombination reporter line 1445 (Gherbi et al. 2001) overexpressing the 
INO80 cDNA under the control of the 35S promoter and with an N-terminal HA-tag 
interrupted by an intron show upregulation of homologous recombination providing evidence 
that INO80 upregulates homologous recombination. 

The yeast homolog (Ebbert et a/,,1999), INO80(=YGL150c) is part of a big complex >1MDa 
(monomeric form is 171KDa), containing two essential helicases Rvblp and Rvb2p and actin 
related proteins (arp) Arp4, Arp5 and Arp8 (Cho et al. 2001 ; Jonsson et al. 2001 ; Wood et al. 
2000). Thus, the involvement of Ino80 in homologous recombination implicates the activity of 
these other genes in homologous recombination. In eukaryotes Human Rvb1 p and Rvb2p 
are also known (Kanemaki 1999, Ikura et al. 2000, Shen et al. 2000). 

In Arabidopsis thaliana we found three genes closely related to Rvbs from other organisms, 
AtRvbl (SEQ ID NO:4, SEQ ID NO:5), AtRvb21 (SEQ ID NO: 6, SEQ ID NO:7) and 
AtRvb22 (SEQ ID NO:8, SEQ ID NO:9). The three genes are expressed (RT-PCR) and 
some of them are positively regulated by genotoxic stress (UVc, bleomycin). For treatment 
with Bleomycin (BLM) 2 week-old Arabidopsis seedlings were placed under sterile conditions 
in liquid GM medium containing lO^M of BLM (Sigma) or 100 ppm of MMS (Fluka, 
Switzerland). For UV-C irradiation (6000 ergs) 2 week-old seedlings were irradiated with light 
provided by a HNS 55W OFR lamp (Osram). After treatment, plants were harvested at 
several time points (30min, 1h, 4h and 12h) and RNA extracted as described above. Then 
semi-quantitative RT-PCR analysis was performed with the following primers Atlno80 
(TGATGGATCTATCACCATCAG, SEQ ID NO:23; ggtgggattccaatcactttc, SEQ ID NO:24) 
AtRvbl (tttgatgggccaaatgatg, SEQ ID NO:25; cttccaaCCTAGGtgagatgtttcaacaaaatgtgc, 
SEQ ID NO:26) AtRvb21 (tcaacagcaggacacaagg, SEQ ID NO:27; 
cccaatgCCTAGGaaatccgagttcaacatcctaatc, SEQ ID NO:28) AtRvb22 
(acaaaccagatatcagcacatgg, SEQ ID NO:29; aacaagtactcgctctcatgctc, SEQ ID NO:30). 

To characterize further the AtinoSO-l HR deficiency, we subjected the mutant to various 
genotoxic stresses. In parallel with the original ino80 mutant we also tested two allelic 
mutants of /A/OS0 from the publicly available SAIL mutant collection. Neither bleomycin nor 
Mitomycin-C or UV-C sensitivity was shifted fn the Atino80-1 mutants, in any of the various 
conditions tested. All the Atino80-A alleles seem to be slightly hypersensitive to MMS (methyl 
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methanesulfonate), which is also known to induce DNA double-strand breaks. The difference 
in sensitivity is visible at 60 and 80ppm of MMS on root elongation and rosette growth. Most 
mutations affecting DNA repair or recombination also give rise to changes in the sensitivity 
to DNA damaging agents. We challenged Atino8Q-'\ mutant plants with various treatments 
known to induce DNA damage and recombination. None of the tested agents (UV-C, 
bleomycin, Mitomycin-C and MMS) gave rise to a shift in sensitivity, with the exception of a 
slight change for MMS. This suggests a difference for the role of INO80 in plants compared 
to yeast (Ebbert et al., 1999; Shen et al. 2000) and supports the use of AHNO80 to regulate 
homologous recombination without affecting the major repair pathway in plants. 

In the sm22 background the steady state level of AtRvb21 and AtRvb22 was shown to be 
down-regulated using RT-PCR on RNA extracted as above mentioned. 
This indicates that the components of the putative Arabidopsis INO80 complex show co- 
regulation at the transcriptional level, supporting the use of Arabidopsis Rvb1, Rvb21 and 
Rvb22 and the Arabidopsis Arp protein orthologs to manipulate homologous recombination 
frequency in plants. 

Example 2: AtRvbl as positive regulator of homologous recombination. 

As describe above (Example 1), the original recombination-up phenotype found in sm22 can 
be associated with an effect mediated by the Arabidopsis Rvb1 and 2 orthologs. Thus, 
AtRvbl can be used as a positive regulator of homologous recombination. 

Example 3: AtRvb21 as positive regulator of homologous recombination. 

As describe above (Example 1), the original recombination-up phenotype found in sm22 can 
be associated with an effect mediated by the Arabidopsis Rvb1 and 2 orthologs. Thus, 
AtRvb21 can be used as a positive regulator of homologous recombination. 

Example 4: AtRvb22 as positive regulator of homologous recombination. 

As describe above (Example 1), the original recombination-up phenotype found in sm22 can 
be associated with an effect mediated by the Arabidopsis Rvb1 and 2 orthologs. Thus, 
AtRvb22 can be used as a positive regulator of homologous recombination. 

Example 5: At3g57290 as positive regulator of homologous recombination. 



WO 2004/003013 



PCT/EP2003/006757 



-22- 

ln the sm22 mutant (Example 5), the At3g57290p gene is potentially overexpressed by the 
35S Enhancer/promoter. Over expression of this gene in the sm22 context or directly with a 
35S promoter can be carried out to reproduce the original recombination-up phenotype. The 
phenotype was lost in the second generation (Example 1), at which point At3g57290 is not 
overexpressed any longer allowing a temporal ability to modulate homologous 
recombination. 

Example 6: AtArp as positive regu)ators of homologous recombination. 

As describe above (Example 1), the original recombination-up phenotype found in sm22 can 
be associated with an effect mediated by other components of the Arabidopsis JNO80 
complex, such as the Arp homolog AtArp5.1, AtArp5.2, AtArp5.3 and/or AtArp8. Any of 
these Arp hmologues can be used as a positive regulator of homologous recombination, 
alone or in combination. 

All publications referred to herein as well as the disclosure of GB patent application 
0214896.3 are incorporated by reference as if each is referred to individually. 
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What is claimed is: 

1. An isolated nucleic acid comprising a sequence having 98.5% or more identity with 
the sequence depicted in SEQ ID NO:1. 

2. The nucleic acid of claim 1 , wherein said nucleic acid is DNA. 

3. A vector comprising the nucleic acid if claim 2. 

4. A host cell comprising the vector or nucleic acid of claim 3. 

5. A polypeptide encoded by the isolated nucleic acid of claim 1 . 

6. A method for inducing homologous recombination in a cell, said method comprising 
modulating the expression or properties of one or more gene products selected 
from the group consisting of Atlno80, At3g57300, Rvbi, Rvb21, Rvb22, Arp5 and 
Arp8, fragments, derivatives and homologues thereof,. 

7. The method of claim 11, said method comprising increasing expression of said 
gene product. 

8. The method of claim 12, said method comprising introducing a nucleic acid 
encoding said gene product into said cell operably linked to a promoter; and 
allowing transcription and translation of said gene in an amount sufficient to affect 
homologous recombination in said cell. 

9. The method of claim 13, wherein said homologous recombination is somatic 
homologous recombination. 

10. The method of claim 13, wherein said homologous recombination is meiotic 
homologous recombination. 

1 1 . The method of claim 13, wherein said promoter is an inducible promoter. 

12. The method of claim 13, wherein said promoter is a tissue-specific promoter. 

13. The method of claim 13, wherein said promoter is a constitutive promoter. 
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14. The method of claim 13, wherein said promoter is a meiosis-specific promoter. 

15. A method of increasing gene targeting to a desired locus in a host cell, said 
method comprising introducing a desired gene into a host cell, modulating the 
expression or properties of one or more gene products selected from the group 
consisting of Atlno80, At3g57300, Rvb1, Rvb21, Rvb22, Arp5 and Arp8 or 
functional fragments, derivatives and homologues thereof in said host cell, and 
detecting integration of said desired gene at a selected locus in the genome of said 
host cell. 
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Figure 1 

>Alignment of Atlno 80 sequence and public sequence, At3g57300, showing splicing 
difference 

Query: claimed sequence 

Sbjct gi|1 841 0689|ref |NM_1 1 5590.1 1 (AGI:At3g57300) 



Query: 1 



atggatcctfccaagacgaccaccgaaggactctccttacgcgaatctattcgatctcgag 60 

l j?Yl llllllllTl 1TI llll IT l ITTI B 1 1 1 ■ 1 1 ■ ■ 1 1 IT 1 1 1 1 1 1 1 1 IT1 1- 



Sbjct: 1 atggatccttcaagacgaccaccgaaggactctccttacgcgaatctattcgatctcgag 60 



Query: 61 



ccattgatgaagtttagaattccgaaacctgaagatgaagttgattattatgggagtagt 120 

t lTi iTi i?l iTl 1 1 lTi ill 1 ■ 1 1 1 1 II 1 1 ■ 1TI ITI iTi »T I ■ 1 1 1 ■ 1 1 Ti 1 1 1 1 1 1 1 19n 

Sbjct: 61 ccgttgatgaagtttagaattccgaaacctgaagatgaagttgattattatgggagtagt 120 
Query: 121 agccaggatgaaagtagaagcactcaaggtggggtagtggcaaactacagcaatgggt^ 180 

III I ITTI IT! I 111 ITI 111 1 1 11 1 ITT ITITT J ITITTI I > I * » I » iTjll ' ' HIiJLi i.« 

Sbjct: 121 agccaggatgaaagtagaagcactcaaggtggggtagtggcaaactacagcaatgggtct 180 



Query: 
Sbjct: 



181 



Query: 241 
Sbjct: 241 
Query: 301 
Sbjct: 301 
Query: 361 
Sbjct: 361 
Query: 421 
Sbjct: 421 
Query: 481 
Sbjct: 481 
Query: 541 
Sbjct: 541 
Query: 601 
Sbjct: 601 
Query: 661 
Sbjct: 661 
Query: 721 
Sbjct: 721 
Query: 781 
Sbjct: 781 
Query: 841 
Sbjct: 841 
Query: 901 
Sbjct: 901 
Query: 961 



aaat< 

ill 



TTTTm?TTi?m?TiTTTT?TTTiTTT!??H 



atgca 240 



181 aaatcgagaatgaatgcgagctccaagaagagaaagcggtggacagaagctgaggatgca 240 



gaggacgatgatgatctctacaatcaacatgttactgaggagcactaccgatcaatgctt 3 00 

TiTTl ITI ITI ITI 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 J 1 1 1 1 IIIITIIIIIIJIIITMI ■ IIM* * , M 

gaggacgatgatgatctctacaatcaacatgttactgaggagcactaccgatcaatgctt 300 
ggggagcatgtacaaaaattcaaaaataggtccaaggagactcaagggaatcctcctcat 360 

TiiiiTimmiiimiimiiimimiiTm _ o 

ggggagcatgtacaaaaattcaaaaataggtccaaggagactcaagggaatcc tec teat 360 



420 



TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTfTTTTTTTTTiTT 4s o 

ctgatgggttttccggtgctaaagagcaatgtgggcagttacagaggtaggaaaccaggg 420 
aatgattaccatgggaggttctatgacatggacaactctccaaattttgcagctgatgtg 480 

1 1 iTi 1 1 1 1 1 1 iTTTitTi 1 1 1 1 iTi 1 1 lTTi 1 1 1 ii it i 1 1 1 iii i ■ i i i ■ 1 1 1 » J. i±X 4fl0 

aatgattaccatgggaggttctatgacatggacaactctccaaattttgcagctgatgtg 480 
accccacataggccaggaagctaccatgatcgtgatattacacccaagatagcatatgaa 540 

iiiiiiiiiiniTiTTiiTiiiiiiiiiiiT ii mi i ii i iii minimi in 5 , 0 

accccacataggcgaggaagctaccatgatcgtgatattacacccaagatagcatatgaa 540 
ccttcgtatttggacattggtgatggtgtcatctacaaaatccccccaagttatgacaag 600 

lliiiifillliil llllii iiilTiiiiliiliiiiil llll lillilTlll IT 1 1 1 1 T soo 

ccttcgtatttggacattggtgatggtgtcatctacaaaatccccccaagttatgacaag 600 
ctggtggcatcattaaacttaccgagcttttcagacattcatgtggaagaattttacttg 660 

1 1 ill iilillltlllllli iliTiillliil llll I T l TT I ■ 1 1 1 il it 1 1 1 IT GS0 

ctggtggcatcattaaacttaccgagcttttcagacattcatgtggaagaattttacttg 660 
aaaggaactctggatctgagatcattagcagaactgatggcaagtgataaaaggtctgga 720 

lllllllllllTTllllllllllllllTllllllllllllllllllllllllllll Til 7?o 

aaaggaactctggatctgagatcattagcagaactgatggcaagtgataaaaggtctgga 720 
gtaagaagccgtaatggaatgggtgagcctcgacctcaatatgaatctcttcaagctaga 780 

Tl I ITI ITI ITI i I ITTI I ITTTITI 1 1 1 1 TTl 1 II 1 1 1 1 1 111 I III II 1 1 llll I 111 

gtaagaagccgtaatggaatgggtgagcctcgacctcaatatgaatctcttcaagctaga 780 



atgaaggccctgtcaccttcaaactcc^ 840 
atgaaggccctgtcaccttcaaactccaccccaaattttagcctcaaggtgtcagaagct 840 



fcaatgaattctgccattccagaaggatctgctggaagtactgcacggacaattctgtct 900 
llllflUlllflllllllllllTTllllMllTlllllllTMlTTllllllllllll 
gcaatgaattctgccattccagaaggatctgctggaagtactgcacggacaattctgtct 900 

mwnniiMifflm^ 

gagggtggtgttttacaggtccattacgtgaagattctggagaagggggatacatacgag st>u 
attgttaaacgaagtctaccgaagaagctgaaagcaaagaatgatcctgcagtcattgag 1020 
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iiiimiiiiimiiiiiiimiiiimiiiiimmimiiiinmiiii lmm 

Sbjct: 961 attgttaaacgaagtctaccgaagaagctgaaagcaaagaatgatcctgcagtcattgag 1020 

Query: 1021 ttttt?tttttttttttttttttttttttttttttttt?t?tttt?tttttttttttttt 21 

Sbjct: 1021 aaaacagaaagggataaaattagaaaagcctggatcaatattgtcagaagagatatagca 1080 
Query: 1081 aaacaccatagaattttcactacttttcatcgtaaactatcaattgatgccaagaggttt 1140 

IIIIIIIIIITIIIIIIIIIIIIIIIIIIIITIIIIIIIIIIIIIIIIIIIIITITIIII nijin 

Sbjct: 1081 aaacaccatagaattttcactacttttcatcgtaaactatcaattgatgccaagaggttt 1140 

Ouery: 1141 gcagatggt egcc ffififfl^ffi^ ffl TT??TTTmTTTTT?TTT J!™ 
Sbjct: 1141 gcagatggttgccaaagagaggtgagaatgaaggtgggtagatcatacaaaatcccaaga 1200 
Query: 1201 actgcaccaattcgcactaggaagatatccagaaacatgctgctattctggaagcgatat 1260 

II1TIIII1111 1TI llIllTllTllllIllTlTllllTllTlllllllTll ll ITIIII „ co 

Sbjct: 1201 actgcaccaattcgcactaggaagatatccagagacatgctgctattctggaagcgatat 1260 

Query: 1261 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT " 2 ° 

Sbjct: 1261 gacaagcagatggcagaagagaggaaaaagcaagaaaaggaagctgcagaggcttttaaa 1320 

Q ~ y * 1321 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 

Sbjct: 1321 cgtgaacaggagcagcgagagtcaaaaaggcagcaacaaaggctcaatttccttattaaa 1380 
Query: 1381 cagactgagctttacagtcacttcatgcaaaacaagaccgattcgaatccttccgaagcc 1440 

llTlllTlTlllllllTllllllllllllllllllllllTlMlllllllllllTllTll 14an 

Sbjct: 1381 cagactgagctttacagtcacttcatgcaaaacaagaccgattcgaatccttccgaagcc 1440 

Q "" sy * 1441 TT?TTTTTTTTTTTTTTTTTTT7TTTTTTTTTT?TTTTTTT?TTTTTTTTTTT?TTTTTT 

Sbjct: 1441 ttaccaataggtgatgaaaatccgattgacgaagtgctcccagaaacttcagcggcagaa 1500 

Cuery: 1501 ffiffififlfffiffiffi ?f TTTTTTTTTTTT ^ 
Sbjct: 1501 ccttctgaggtagaggatcctgaagaggctgaactgaaggaaaaggtcttgagagctgcc 1560 

Query: 1561 caagatgcggtgtctaagcagaagcaaataacagatgcatttgacactgaatatatgaag 1620 

1 1 iTl 1T1 TT IT I II1II1ITIIIIIIIII11I 111 IHIIlTllltJI 1111 1 1 IT 1 1 1 ie , n 

Sbjct: 1561 caagatgcggtgtctaagcagaagcaaataacagatgcatttgacactgaatatatgaag 1620 
Query: 1621 ctacgccaaacttctgaaatggaaggtcctttaaatgatatatcagtttctggctcgagc 1680 

1 1 1 iTl 1 1 11 ! 1 1 1 1 1 1 1 1 1 TT 1 1 Tl 1 1 1 1 1 1 ! i 1 1 1 1 1 1 1 1 1 1 IT 1 1 1 1 ITT I lllll icnn 

Sbjct: 1621 ctacgccaaacttctgaaatggaaggtcctttaaatgatatatcagtttctggctcgagc 1680 
Query: 1681 aatatagatttgcataacccatctacaatgcctgttacatcaacagttcagactccagag 1740 

llllllTllllTllillllllllllllllllllllllllllllllTlllllllllllllT „ jA 

Sbjct: 1681 aatatagatttgcataacccatctacaatgcctgttacatcaacagttcagactccagag 1740 
Query: 1741 ttatttaaaggaacccttaaagaataccaaatgaaaggccttcagtggctagtcaattgt 1800 

^ illlllllllllllllMlllTllllllllllllllltlllllll I II II lllll M II iQnn 

Sbjct: 1741 ttatttaaaggaacccttaaagaataccaaatgaaaggccttcagtggctagtcaattgt 1800 
Query: 1801 tatgagcagggtttgaatggcatacttgctgatgaaatgggcttgggtaagactattcaa 1860 

I I iTlTl ITTT 1 11T1I1I111I1I1IIIIIIII1I.1 11T1 1 1 1 ITT1 Illlllllllll locn 

Sbjct: 1801 tatgagcagggtttgaatggcatacttgctgatgaaatgggcttgggtaagactattcaa 1860 
Query: 1861 gctatggcgttcttggcacatttggctgaggaaaagaacatttggggtccatttcttgtt 1920 

Yi 1 1 iTt it ■ i iiiTiiiiiii iTT 1 1 1 iTT i iiiiiiiiiiiiiTiii inn mm . 

Sbjct: 1861 gctatggcgttcttggcacatttggctgaggaaaagaacatttggggtccatttcttgtt 1920 
Query: 1921 gttgcccctgcctctgttcttaacaattgggctgatgaaatcagtcgtttctgtcctgac 1980 

Ti i T 1 1 1 1 1 T i f f i i 1 1 1 1 f f 1 1 1 1 1 f f TT fiiTfiiriiiiiTtiTiffii T 1 1 1 1 T i I ioon 

Sbjct: 1921 gttgcccctgcctctgttcttaacaattgggctgatgaaatcagtcgtttctgtcctgac 1980 
Query: 1981 ttgaaaactcttccatattggggaggattacaagaacgaacaattttaagaaagaatatc 2040 

MTI III II IIIIIIIIIIIIITIII II II II I 111 lllll 1 1 II II I 111 I lllll I II • 

Sbjct: 1981 ttgaaaactcttccatattggggaggattacaagaacgaacaattttaagaaagaatatc 2040 
Query: 2041 aatcccaagcgtatgtaccgaagggatgctggctttcatattttgattactagctatcag 2100 

iii ii ii ii nil ill i ii ti i nil ii i inn ii iii ii ii an 1 1 ii ii in i ii i oinn 

Sbjct: 2041 aatcccaagcgtatgtaccgaagggatgctggctttcatattttgattactagctatcag 2100 

Query: 2101 ctattagtcactgatgaaaagtattttcgccgggtgaagtggcaatatatggtgctagat 2160 

III IIITII II II II ll I II II II Hl^ ^ n 
Sbjct: 2101 ctattagtcactgatgaaaagtattttcgccgggtgaagtggcaatatatggtgctagat 2160 

Ouerv: 2161 gaggcccaagcaatcaagagttcctccagtataagatggaaaacccttcttagttttaac 2220 

i iT ii 1 1 1 IT 1 1 1 1 1 1 1 1 1 T 1 1 1 1 1 1 1 1 T 1 1 1 1 1 T l mm mi ii illlllllllll 
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Sbjct: 2161 gaggcccaagcaatcaagagttcctccagtataagatggaaaacccttcttagttttaac 2220 
tgtcggaaccgattgcttctgactggtactccaattcagaacaacatggcagagttatgg 2280 

B 1 1 ITT IIIITIIITIIIIITII 1TTI lllltllllllTllllIll iTT ■ IT IT 1 1 1 1 T T 

tgtcggaaccgattgcttctgactggtactccaattcagaacaacatggcagagttatgg 2280 
gccctgctgcatttcatcatgccaatgttgtttgacaaccatgatcaatttaatgaatgg 2340 

, TllllTllllllllllllllllllllllillllTllllllllTlllllllllllllllTl 

Sb^ct: 2281 gccctgctgcatttcatcatgccaatgttgtttgacaaccatgatcaatttaatgaatgg 2340 
ttctcaaaaggaattgagaatcatgctgaacacggaggcactttaaatgagcaccagctt 2400 

llllllllllllllllllllllllTllllilllTllTTlllllllllllilliilfllll 

ttctcaaaaggaattgagaatcatgctgaacacggaggcactttaaatgagcaccagctt 2400 
aacagactgcatgcga tc t tgaaaccgt teat get tcgacgggtaaaaaaggatgtggt t 24 60 

, llllllllllllllllllllTlllllllllll.llllllllTlTlllllllTllllllTll 

2401 aacagactgcatgcgatcttgaaaccgttcatgcttcgacgggtaaaaaaggatgtggtt 2460 
tctgagctaactacaaagacggaagttacagtacactgcaagctcagttctcgacaacaa 2520 

lllllllllltllllllTllllllTllllllllllllTlllTllllTlllllllllllll 

tctgagctaactacaaagacggaagttacagtacactgcaagctcagttctegacaaeaa 2520 
Query: 2521 gctttttatcaggctattaagaacaaaatttctctggctgagttgtttgatagcaaccgc 2580 

lllllllllllTllllllllTlllllllllllllllTlllllllllllTllllllllill 

gctttttatcaggctattaagaacaaaatttctctggctgagttgtttgatagcaaccgc 2580 
ggacaatttactgataagaaagtattgaatttaatgaa tat tg teat tcaactaaggaag 2640 

TTllllllllllTllllTllllllllTlllllllllllllllllllllllllllllini 

ggacaatttactgataagaaagtattgaatttaatgaatattgtcattcaactaaggaag 2640 
gtttgcaaccatccagagttgttcgaaaggaatgaagggagctcgtatctctactttgga 2700 

lllllllllllllllTlTllllllllllTlllllllllTllllllllllllllllllllI 

Sbjct: 2641 gtttgcaaccatccagagttgttcgaaaggaatgaagggagctcgtatctctactttgga 2700 
gtgacttccaattctcttttgccccatccctttggtgagctagaggatgtacattattct 2760 

lllllllllllllllllllllllllllllllllTTllUllllllTllTllillllllll 

Sbjct: 2701 gtgacttccaattctcttttgccccatccctttggtgagctagaggatgtacattattct 2760 
ggtggtcaaaatccgataatatacaagatacctaagctactacaccaagaggtgctccaa 2820 

IIIIIIIIIIIIIITIIIIIIIIIIITIIIIIIIIIIIIIIIIIIIIITIIIIIIIIIII 

2761 ggtggtcaaaatccgataatatacaagatacctaagctactacaccaagaggtgctccaa 2820 
aattctgaaacattttgttcttctgtcgggcgtggcatctcaagagaatcttttctgaag 2880 

llllllTlllllllllTllllllllllllTlllllllllllllllTllllllllllTllT 

2821 aattctgaaacattttgttcttctgtcgggcgtggcatctcaagagaatcttttctgaag 2880 
2881 cattttaatatatattcacctgagtatattcttaagtcaatattcccatctgatagtggg 2940 

„ lllllllllllllllllllllTlllllllllllllTllllllllllllllllllllllll 

2881 cattttaatatatattcacctgagtatattcttaagtcaatattcccatctgatagtggg 2940 
gtagatcaagtggttagtggaagtggagcatttggcttttcacgcttgatggatctatca 3000 

TllIlllllTllllllllllllTllll[llllllllll!lll)l!ll!llllllllilll 

2941 gtagatcaagtggttagtggaagtggagcatttggcttttcacgcttgatggatctatca 3000 
ccatcagaagttggatatctggctctgtgttctgttgcagaaaggctattattttctata 3060 

illlliTllllllTilllllllllllTlillllllllllTllllllllllllllllllll 

3001 ccatcagaagttggatatctggctctgtgttctgttgcagaaaggctattattttctata 3060 



Sbjct: 


2161 


Query: 


2221 


Sbjct: 


2221 


Query: 


2281 


Sbjct: 


2281 


Query: 


2341 


Sbjct: 


2341 


Query: 


2401 


Sbjct: 


2401 


Query: 


2461 


Sbjct: 


2461 


Query: 


2521 


Sbjct : 




Query: 


2581 


Sbjct: 


2581 


Query: 


2641 


Sbjct: 


2641 


Query: 


2701 


Sbjct: 


2701 


Query: 


2761 


Sbjct : 


2761 


Query: 


2821 


Sbjct: 


2821 


Query: 


2881 


Sbjct: 


2881 


Query: 


2941 


Sbjct: 


2941 


Query: 


3001 


Sbjct: 


3001 


Query: 


3061 


Sbjct: 


3061 


Query: 


3121 


Sbjct: 


3121 


Query: 


3181 


Sbjct: 


3181 


Query: 


3241 



fat 3120 



tgaggtgggagcggcaa 1 1 1 1 1 gga tgaat tagt taac tc tc 1 1 atggagtccaagga t 

iiiiimiiTiiiiiiiiiimim 



gg tga t c t tag tgacaa t aaca tcgagagagt taaaaccaaage tgtcacaagaa tgt tg 3180 

iTlTllllllTlTlllliiiiiiiiiTlTlllllllllllllllllllllllllllillT 

Sbjct: 3121 ggtgatcttagtgacaataacatcgagagagttaaaaccaaagctgtcacaagaatgttg 3180 
ctgatgccatcaaaagttgaaacgaattttcagaaaaggagactaagcacagggcctacc 3240 

„ S1 liTllllllllllllTlllllllllillllllTlllllllllllllllllllMIIIIII 

3181 ctgatgccatcaaaagttgaaacgaattttcagaaaaggagactaagcacagggcctacc 3240 
Query: 3241 cgtccttcatttgaagcgctagtgatctctcatcaggataggtttctttcaagtatcaaa 3300 
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llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
cgtccttcatttgaagcgctagtgatctctcatcaggataggtttctttcaagtatcaaa 3300 

3301 ctcc tgcat tc tgcatatac 1 1 at atcccaaaagccagagc tccacc tgtaagcat teat 3360 

„«, lllllllllllllllllllllllllllllllllllllllTllllllllTlllllllllll 

Sbjct: 3301 ctcctgcattctgcatatacttatatcccaaaagccagagctccacctgtaagcattcat 3360 
3361 tgctcggacagaaattcggcatacagagttacagaagaattacatcaaccatggcttaag 3420 

, „ llll llilll Mill llTTllllllllllillllllilllllllllllllllTTllllll 

33 61 tgctcggacagaaattcggcatacagagttacagaagaattacatcaaccatggcttaag 3420 
agactattaatcggttttgcacgaacgtcagaagctaatggacccaggaagcctaacagc 3480 

ITIIIIIIIIIITIIIIIIIIIIIIIIIIIIIITIIIIIITIIIIIIIIITIIIIIIIII 

agactattaatcggttttgcacgaacgtcagaagctaatggacccaggaagcctaacagc 3480 
tttccacatcctttaatccaagaaattgattcagaacttccagttgtgcagcctgcgctt 3540 

IIIIIIIIIIIIIIIIIIIIITIIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

3481 tttccacatcctttaatccaagaaattgattcagaacttccagttgtgcagcctgcgctt 3540 
caactgacacacagaatatttggttcttgccctccaatgcaaagttttgacccagcaaag 3600 

lllllllllllllIllllllllllllllTlllllllllTlllllllllTlllllllllll 

caactgacacacagaatatttggttcttgccctccaatgcaaagttttgacccagcaaag 3600 
ttgctcacggactctgggaagctgcagacacttgatatattattgaagcggcttcgagct 3660 

llTlllllllllllllTTllllllllllllilllllllllllllTllllllllllillll 



Sbjct: 


3241 


Query: 


3301 


Sbjct: 


3301 


Query: 


3361 


Sbjct: 


3361 


Query: 


3421 


Sbjct: 


3421 


Query: 


3481 


Sbjct: 


3481 


Query: 


3541 


Sbjct: 


3541 


Query: 


3601 


Sbjct: 


3601 


Query: 


3661 


Sbjct: 


3661 


Query: 


3721 


Sbjct: 


3721 


Query: 


3781 


Sbjct: 


3781 


Query: 


3841 


Sbjct: 


3841 


Query: 


3901 


Sbjct: 


3901 


Query: 


3961 


Sbjct: 


3961 



ggaaatcacagggtgctcctgtttgcacaaatgacaaagatgctgaacattctcgaggat 3720 

T T 1 1 ■ 1 1 1 1 1 T T T ITII lllTlllTlllllllTllllll Ill mil 

3661 ggaaatcacagggtgctcctgtttgcacaaatgacaaagatgctgaacattctcgaggat 3720 
Query: 3721 tatatgaactatagaaagtacaagtacctcaggcttgatggatcctccaccatcatggat 3780 

m mi 1 1 m 1 1 1 1 1 1 1 1 1 mi i m 1 1 mmii m i m 1 1 1 m 1 1 mill i 



cgccgagatatggttagggattttcagcataggagcgatatttttgtattcttgctgagc 3840 

^ 1 1 1 J 1 1 1 1 1 1 1 1 T 1 1 1 ITT 1 1 1 f 1 1 1 T 1 1 i I TT iTtTllIIIlllfifllllfTtlTjTl 

Sbjct: 3781 cgccgagatatggttagggattttcagcataggagcgatatttttgtattcttgctgagc 3840 
accagagctggaggacttggtatcaacttgacggctgcagacactgtcattttctatgaa 3900 

^ iiiiTiTiiTiiiTiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

3841 accagagctggaggacttggtatcaacttgacggctgcagacactgtcattttctatgaa 3900 
agtgattggaatcccaccttggatttacaagctatggacagggctcatcgtcttggacag 3960 

, onl llllllllTllinilllilllllllllllllllllllllTllllllllTllllTllIlT 

3901 agtgattggaatcccaccttggatttacaagctatggacagggctcatcgtcttggacag 3960 



iiffi? 



Score « 1001 bits (505), Expect = 0.0 
Identities = 522/528 (98%), Gaps = 6/528 (1%) 
Strand = Plus / Plus 

Query: 3997 gagacggtggaagagaaaattttgcacagggcaagtcagaaaaatacagttcaacagctt 4056 

cw, ,o-,o iiiiiiiiiiiiiiiniiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 3970 gagacggtggaagagaaaattttgcacagggcaagtcagaaaaatacagttcaacagctt 4029 
Query: 4057 gttatgactggagggcatgttcagggtgatgattttcttggagctgcggatgtggtatct 4116 

cw An , n Tl ■ I ITI 1 J ITITTTI 1 1 1 1 1 1 ITTI IIIITIIIIIIJIITITIITITIIIIIITIIIII 

Sbjct: 4030 gttatgactggagggcatgttcagggtgatgattttcttggagctgcggatgtggtatct 4089 
Query: 4117 ctgctaatggatgatgcggaggcagcacaactggagcagaaattcagagaactaccatta 4176 
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lllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

c taatggatgatgcggaggcagcacaac tggagcagaaat tcagagaac taccat ta 4149 
g taaaggacaggcagaagaaaaagacgaaacgtatcagaatagatgctgaaggagat 4236 

...T lllTTllTIITIIIIlTllTlillllllllTllinilTllTllllllll 4on , 

Sbjct: 4150 cagg acaggcagaagaaaaagacgaaacgtatcagaatagatgctgaaggagat 4203 

gcaactttggaagagttagaagatgttgaccgacaggataacggacaggaacctttcgaa 4296 

1 1 1 1 i 1 1 1 T T 1 lllTl 1 1TI iTi ITI IT T T 1 1 1 1 1 TT 1 1 1 T7 1 1 1 1 1 1 1 TT 1 1 ^ 

gcaactttggaagagttagaagatgttgaccgacaggataacggacaggaacctttggaa 4263 
gaaccggaaaagccaaaatccagtaataaaaagaggagagctgcttcaaatccgaaagct 4355 

TIIIITTIIIIIIIIIIIIIIITIIII TllllTlTllllllMllllllllllll 4 „, 

Sbjct: 4264 gaaccggaaaagccaaaatccagtaataaaaagaggagagctgcttcaaatccgaaagct 4323 
agagctcctcagaaagcaaaggaagaagcaaatggtgaagatactcctcagacrgacaaaa 4416 

iTlTlllllllTlllTllllTllllllllllllllllllTllllllllllllTTllllll 

agagctcctcagaaagcaaaggaagaagcaaatggtgaagatactcctcagaggacaaaa 4383 
agggtaaagagacaaacaaagagcataaacgaaagtcttgaacctgtattctctgcctct 4476 

inTllllflTlllllllllIlllllllllllllllllllllllllllNllllTlllll AM 

4384 agggtaaagagacaaacaaagagcataaacgaaagtcttgaacctgtattctctgcctct 4443 

?taacagaatcaaataaaggattcgatccaagtagctccgctaactaa 4524 
lllllTllllllllllillllllllllllUllTllllTllllllll tm 

Sbjct: 4444 gtaacagaatcaaataaaggattcgatccaagtagctccgctaactaa 4491 



Sbjct: 


4090 


Query: 
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4150 
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4237 


Sbjct: 


4204 
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4297 


Sbjct: 


4264 


Query: 


4357 


Sbjct: 


4324 
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4417 
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4384 
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SEQUENCE LISTING 



<110> Novartis Forschungsstiftung, zweigniederlassung Friedrich Miescher 
institute for Biomedical Research 



<120> Gene for increased somatic recombination 

<130> 1-32546B 

<160> 30 

<170> Patentin version 3.1 

<210> 1 

<211> 4524 

<212> DNA 

<213> Arabidopsis thai i ana 
<220> 

<221> CDS 

<222> (1)..(4524) 

<223> 



<400> 1 

atg gat cct tea aga cga cca ccg aag gac tct cct tac gcg aat eta 48 
Met Asp Pro ser Arg Arg Pro Pro Lys Asp Ser Pro Tyr Ala Asn Leu 
1 5 10 15 

ttc gat etc gag ccg ttg atg aag ttt aga att ccg aaa cct gaa gat 96 
Phe Asp Leu Glu Pro Leu Met Lys Phe Arg lie Pro Lys Pro Glu Asp 
20 25 30 

gaa gtt gat tat tat ggg agt agt age cag gat gaa agt aga age act 144 
Glu Val Asp Tyr Tyr Gly Ser Ser Ser Gin Asp Glu Ser Arg ser Thr 
35 40 45 

caa ggt ggg gta gtg gca aac tac age aat ggg tct aaa teg aga atg 192 
Gin Gly Gly Val val Ala Asn Tyr Ser Asn Gly Ser Lys ser Arg Met 
50 55 60 

aat gcg age tec aag aag aga aag egg tgg aca gaa get gag gat gca 240 
Asn Ala Ser Ser Lys Lys Arg Lys Arg Trp Thr Glu Ala Glu Asp Ala 
65 70 75 80 
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gag gac gat gat gat etc tac aat caa cat gtt act gag gag cac tac 288 
Glu Asp Asp Asp Asp Leu Tyr Asn Gin His val Thr Glu Glu His Tyr 
85 90 95 

cga tea atg ctt ggg gag cat gta caa aaa ttc aaa aat agg tec aag 336 
Arg ser Met Leu Gly Glu His val Gin Lys Phe Lys Asn Arg ser Lys 
100 105 110 



gag act caa ggg aat cct cct cat ctg atg ggt ttt ccg gtg eta aag 
Glu Thr Gin Gly Asn Pro Pro His Leu Met Gly Phe Pro Val Leu Lys 
115 120 125 



999 agg ttc tat gac atg gac aac tct cca aat ttt gca get gat gtg 
Gly Arg Phe Tyr Asp Met Asp Asn Ser Pro Asn Phe Ala Al.a Asp Val 
145 150 155 160 



384 



age aat gtg gqc agt tac aga ggt agg aaa cca ggg aat gat tac cat 432 
ser Asn val Gly Ser Tyr Arg Gly Arg Lys Pro Gly Asn Asp Tyr His 
130 135 140 



480 



acc cca cat agg cga gga age tac cat gat cgt gat att aca ccc aag 528 
Thr Pro His Arg Arg Gly Ser Tyr His Asp Arg Asp lie Thr Pro Lys 
165 170 175 

ata gca tat gaa cct teg tat ttg gac att ggt gat ggt gtc ate tac 576 
lie Ala Tyr Glu Pro Ser Tyr Leu Asp lie Gly Asp Gly Val He Tyr 
180 185 190 

aaa ate ccc cca agt tat gac aag ctg gtg gca tea tta aac tta ccg 624 
Lys lie Pro Pro Ser Tyr Asp Lys Leu Val Ala Ser Leu Asn Leu Pro 
195 200 205 

age ttt tea gac att cat gtg gaa gaa ttt tac ttg aaa gga act ctg 672 
Ser Phe Ser Asp lie His val Glu Glu Phe Tyr Leu Lys Gly Thr Leu 
210 215 220 

gat ctg aga tea tta gca gaa ctg atg gca agt gat aaa agg tct gga 720 
Asp Leu Arg ser Leu Ala Glu Leu Met Ala Ser Asp Lys Arg Ser Gly 
225 230 235 240 

gta aga age cgt aat gga atg ggt gag cct cga cct caa tat gaa tct 
val Arg Ser Arg Asn Gly Met Gly Glu Pro Arg Pro Gin Tyr Glu ser 
245 250 255 



768 



ctt caa get aga atg aag gee ctg tea cct tea aac tec acc cca aat 816 

Leu Gin Ala Arg Met Lys Ala Leu ser Pro Ser Asn Ser Thr Pro Asn 
260 265 270 

ttt age etc aag gtg tea gaa get gca atg aat tct gee att cca gaa 864 

Phe Ser Leu Lys val ser Glu Ala Ala Met Asn Ser Ala lie Pro Glu 

275 280 285 

gga tct get gga agt act gca egg aca att ctg tct gag ggt ggt gtt 912 

Gly ser Ala Gly ser Thr Ala Arg Thr He Leu ser Glu Gly Gly Val 

290 295 300 

tta cag gtc cat tac gtg aag att ctg gag aag ggg gat aca tac gag 960 

Leu Gin val His Tyr val Lys lie Leu Glu Lys Gly Asp Thr Tyr Glu 

305 310 315 320 

att gtt aaa cga agt eta ccg aag aag ctg aaa gca aag aat gat cct 1008 

lie Val Lys Arg ser Leu Pro Lys Lys Leu Lys Ala Lys Asn Asp Pro 
325 330 335 

gca gtc att gag aaa aca gaa agg gat aaa att aga aaa gee tgg ate 1056 

Ala val lie Glu Lys Thr Glu Arg Asp Lys lie Arg Lys Ala Trp lie 
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340 345 350 

aat att gtc aga aga gat ata gca aaa cac cat aga att ttc act act 1104 
Asn lie val Arg Arg Asp He Ala Lys His His Arg lie Phe Thr Thr 
355 360 365 

ttt cat cgt aaa eta tea att gat gec aag agg ttt gca gat ggt tgc 1152 
Phe His Arg Lys Leu ser lie Asp Ala Lys Arg Phe Ala Asp Gly cys 
370 375 380 

caa aga gag gtg aga atg aag gtg ggt aga tea tac aaa ate cca aga 1200 
Gin Arg Glu Val Arg Met Lys val Gly Arg ser Tyr Lys lie Pro Arg 
385 390 395 400 

act gca cca att cgc act agg aag ata tec aga gac atg ctg eta ttc 1248 
Thr Ala Pro lie Arg Thr Arg Lys lie Ser Arg Asp Met Leu Leu Phe 
405 410 415 

tgg aag cga tat gac aag cag atg gca gaa gag agg aaa aag caa gaa 1296 
Trp Lys Arg Tyr Asp Lys Gin Met Ala Glu Glu Arg Lys Lys Gin Glu 
420 425 430 

aag gaa get gca gag get ttt aaa cgt gaa cag gag cag cga gag tea 1344 
Lys Glu Ala Ala Glu Ala Phe Lys Arg Glu Gin Glu Gin Arg Glu Ser 
435 440 445 



aaa agg cag caa caa agg etc aat ttc ctt att aaa cag act gag ctt 1392 
Lys Arg Gin Gin Gin Arg Leu Asn Phe Leu He Lys Gin Thr Glu Leu 
450 455 460 



tac agt cac ttc atg caa aac aag ace gat teg aat cct tec gaa gee 1440 
Tyr ser His Phe Met Gin Asn Lys Thr Asp Ser Asn Pro Ser Glu Ala 
465 470 475 480 



tta cca ata ggt gat gaa aat ccg att gac gaa gtg etc cca gaa act 
Leu Pro lie Gly Asp Glu Asn Pro lie Asp Glu val Leu Pro Glu Thr 
485 490 495 



ggc ctt cag tgg eta gtc aat tgt tat gag cag ggt ttg aat ggc ata 
Gly Leu Gin Trp Leu val Asn cys Tyr Glu Gin Gly Leu Asn Gly lie 
595 600 605 



1488 



tea gcg gca gaa cct tct gag gta gag gat cct gaa gag get gaa ctg 1536 
Ser Ala Ala Glu Pro Ser Glu val Glu Asp Pro Glu Glu Ala Glu Leu 
500 505 510 

aag gaa aag gtc ttg aga get gee caa gat gcg gtg tct aag cag aag 1584 
Lys Glu Lys Val Leu Arg Ala Ala Gin Asp Ala val ser Lys Gin Lys 
515 520 525 

caa ata aca gat gca ttt gac act gaa tat atg aag eta cgc caa act 1632 
Gin lie Thr Asp Ala Phe Asp Thr Glu Tyr Met Lys Leu Arg Gin Thr 
530 535 540 

tct gaa atg gaa ggt cct tta aat gat ata tea gtt tct ggc teg age 1680 
Ser Glu Met Glu Gly Pro Leu Asn Asp lie ser val ser Gly Ser ser 
545 550 555 560 

aat ata gat ttg cat aac cca tct aca atg cct gtt aca tea aca gtt 1728 
Asn lie Asp Leu His Asn Pro Ser Thr Met Pro Val Thr Ser Thr Val 
565 570 575 

cag act cca gag tta ttt aaa gga acc ctt aaa gaa tac caa atg aaa 1776 
Gin Thr Pro Glu Leu Phe Lys Gly Thr Leu Lys Glu Tyr Gin Met Lys 
580 585 590 



1824 
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ctt get gat gaa atg ggc 
Leu Ala Asp Glu Met Giy 
610 



ttg gca cat ttg 
Leu Ala His Leu 
625 

gtt gec cct gec 
val Ala Pro Ala 



ttc tgt cct gac 
Phe Cys Pro Asp 
660 

cga aca att tta 
Arg Thr lie Leu 
675 

gat get ggc ttt 
Asp Ala Giy Phe 
690 

gat gaa aag tat 
Asp Glu Lys Tyr 
705 

gag gec caa gca 
Glu Ala Gin Ala 



get gag 
Ala Glu 
630 

tct gtt 
Ser val 
645 

ttg aaa 
Leu Lys 



ctt agt ttt aac 
Leu Ser Phe Asn 
740 

cag aac aac atg 
Gin Asn Asn Met 
755 

atg ttg ttt gac 
Met Leu Phe Asp 
770 

att gag aat cat 
He Glu Asn His 
785 

aac aga ctg cat 
Asn Arg Leu His 



aag gat gtg gtt 
Lys Asp val val 
820 

tgc aag etc agt 
cys Lys Leu ser 
835 

aaa att tct ctg 
Lys lie Ser Leu 
850 

gat aag aaa gta 
Asp Lys Lys val 



ttg ggt aag act att caa get atg gcg ttc 
Leu Giy Lys Thr lie Gin Ala Met Ala Phe 
615 620 

gaa aag aac att tgg ggt cca ttt ctt gtt 
Glu Lys Asn He Trp Giy Pro Phe Leu Val 
635 640 

ctt aac aat tgg get gat gaa ate agt cgt 
Leu Asn Asn Trp Ala Asp Glu lie Ser Arg 
650 655 

act ctt cca tat tgg gga gga tta caa gaa 
Thr Leu Pro Tyr Trp Giy Giy Leu Gin Glu 
665 670 

aat ate aat ccc aag cgt atg tac cga agg 
Asn lie Asn Pro Lys Arg Met Tyr Arg Arg 
680 685 

ttg att act age tat cag eta tta gtc act 
Leu lie Thr Ser Tyr Gin Leu Leu Val Thr 
695 700 

egg gtg aag tgg caa tat atg gtg eta gat 
Arg val Lys Trp Gin Tyr Met vaT Leu Asp 
715 720 

agt tec tec agt ata aga tgg aaa ace ctt 
ser Ser ser Ser lie Arg Trp Lys Thr Leu 
730 735 

aac cga ttg ctt ctg act ggt act cca att 
Asn Arg Leu Leu Leu Thr Giy Thr Pro lie 
745 750 

tta tgg gec ctg ctg cat ttc ate atg cca 
Leu Trp Ala Leu Leu His Phe lie Met Pro 
76b 765 

gat caa ttt aat gaa tgg ttc tea aaa gga 
Asp Gin Phe Asn Glu Trp Phe Ser Lys Giy 
775 780 

cac gga ggc act tta aat gag cac cag ctt 
His Giy Giy Thr Leu Asn Glu His Gin Leu 
795 800 

ttg aaa ccg ttc atg ctt cga egg gta aaa 
Leu Lys Pro Phe Met Leu Arg Arg val Lys 
810 815 

eta act aca aag acg gaa gtt aca gta cac 
Leu Thr Thr Lys Thr Glu val Thr val His 
825 830 

caa caa get ttt tat cag get att aag aac 
Gin Gin Ala Phe Tyr Gin Ala lie Lys Asn 
840 845 

ttg ttt gat age aac cgc gga caa ttt act 
Leu Phe Asp Ser Asn Arg Giy Gin Phe Thr 
855 860 

ttg aat tta atg aat att gtc att caa eta agg aag 
Leu Asn Leu Met Asn He val lie Gin Leu Arg Lys 



aga aag 
Arg Lys 



cat att 
His lie 



ttt cgc 
Phe Arg 
710 

ate aag 
lie Lys 
725 

tgt egg 
Cys Arg 



gca gag 
Ala Glu 



aac cat 
Asn His 



get gaa 
Ala Glu 
790 

gcg ate 
Ala lie 
805 

tct gag 
ser Glu 



tct cga 
ser Arg 



get gag 
Ala Glu 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 
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865 870 875 880 

gtt tgc 
val Cys 



gtt tgc aac cat cca gag ttg ttc gaa agg aat gaa gag age teg tat 
Asn His Pro Glu Leu Phe Glu Arg Asn Glu Gly Ser Ser Tyr 



885 890 895 

etc tac ttt gga gtg act tec aat tct ctt ttg ccc cat ccc ttt ggt 

Leu Tyr Phe Gly val Thr ser Asn Ser Leu Leu Pro His Pro Phe Gly 

900 905 910 

gag eta gag gat gta cat tat tct ggt ggt caa aat ccg ata ata tac 

Glu Leu Glu Asp Val His Tyr ser Gly Gly Gin Asn Pro lie He Tyr 

915 920 925 



aag ata cct aag eta eta cac caa gag gtg etc caa aat tct gaa aca 
Lys lie Pro Lys Leu Leu His Gin Glu Val Leu Gin Asn Ser Glu Thr 
930 935 940 



2688 



2736 



2784 



etc caa aat tct gaa aca 2832 

2880 



3024 



3069 



ttt tgt tct tct gtc ggg cgt ggc ate tea aga gaa tct ttt ctg aag 
Phe cys ser ser Val Gly Arg Gly lie Ser Arg Glu Ser Phe Leu Lys 
945 950 955 960 

cat ttt aat ata tat tea cct gag tat att ctt aag tea ata ttc cca 2928 
His Phe Asn lie Tyr ser pro Glu Tyr He Leu Lys ser lie Phe Pro 
965 970 975 

tct gat agt ggg gta gat caa gtg gtt agt gga agt gga gca ttt ggc 2976 
ser Asp Ser Gly val Asp Gin val Val Ser Gly ser Gly Ala Phe Gly 
980 985 990 

ttt tea cgc ttg atg gat eta tea cca tea gaa gtt gga tat ctg get 
Phe ser Arg Leu Met Asp Leu Ser Pro Ser Glu val Gly Tyr Leu Ala 
995 1000 1005 

ctg tgt tct gtt gca gaa agg eta tta ttt tct ata ctg agg tgg 
Leu cys Ser val Ala Glu Arg Leu Leu Phe Ser lie Leu Arg Trp 
1010 1015 1020 

gag egg caa ttt ttg gat gaa tta gtt aac tct ctt atg gag tec 3114 
Glu Arg Gin Phe Leu Asp Glu Leu Val Asn Ser Leu Met Glu Ser 
1025 1030 1035 

aag gat ggt gat ctt agt gac aat aac ate gag aga gtt aaa acc 
Lys Asp Gly Asp Leu ser Asp Asn Asn lie Glu Arg Val Lys Thr 
y 1040 1045 1050 

aaa get gtc aca aga atg ttg ctg atg cca tea aaa gtt gaa acg 
Lvs Ala val Thr Arg Met Leu Leu Met Pro ser Lys Val Glu Thr 
1055 ~ 1060 1065 

aat ttt cag aaa agg aga eta age aca ggg cct acc cgt cct tea 
Asn Phe Gin Lys Arg Arg Leu Ser Thr Gly Pro Thr Arg Pro ser 
1070 1075 1080 

ttt gaa gcg eta gtg ate tct cat cag gat agg ttt ctt tea agt 
Phe Glu Ala Leu Val lie ser His Gin Asp Arg Phe Leu Ser ser 
1085 1090 1095 

ate aaa etc ctg cat tct gca tat act tat ate cca aaa gee aga 
lie Lys Leu Leu His Ser Ala Tyr Thr Tyr lie Pro Lys Ala Arg 
1100 1105 1110 

get cca cct gta age att cat tgc teg gac aga aat teg gca tac 
Ala Pro Pro Val Ser lie His Cys ser Asp Arg Asn Ser Ala Tyr 
1115 U20 H25 



3159 



3204 



3249 



3294 



3339 



3384 
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aga gtt aca gaa gaa tta cat caa cca tgg ctt aag aga eta tta 3429 
Arg Val Thr Glu Glu Leu His Gin Pro Trp Leu Lys Arg Leu Leu 
1130 1135 1140 



ate ggt ttt gca cga acg tea gaa get aat gga ccc agg aag cct 
He Gly Phe Ala Arg Thr Ser Glu Ala Asn Gly Pro Arg Lys Pro 
1145 1150 1155 



3474 



aac age ttt cca cat cct tta ate caa gaa att gat tea gaa ctt 3519 
Asn ser Phe Pro His Pro Leu He Gin Glu lie Asp ser Glu Leu 
1160 1165 1170 

cca gtt gtg cag cct gcg ctt caa ctg aca cac aga ata ttt ggt 3564 
pro val Val Gin Pro Ala Leu Gin Leu Thr His Arg lie Phe Gly 
1175 1180 1185 

tct tgc cct cca atg caa agt ttt gac cca gca aag ttg etc acg 3609 
Ser cys Pro Pro Met Gin Ser Phe Asp Pro Ala Lys Leu Leu Thr 
1190 1195 1200 

gac tct ggg aag ctg cag aca ctt gat ata tta ttg aag egg ctt 3654 
Asp Ser Gly Lys Leu Gin Thr Leu Asp lie Leu Leu Lys Arg Leu 
1205 1210 1215 

cga get gqa aat cac agg gtg etc ctg ttt gca caa atg aca aag 3699 
Arg Ala Gly Asn His Arg Val Leu Leu Phe Ala Gin Met Thr Lys 
1220 1225 1230 

atg ctg aac att etc gag gat tat atg aac tat aga aag tac aag 3744 
Met Leu Asn lie Leu Glu Asp Tyr Met Asn Tyr Arg Lys Tyr Lys 
1235 1240 1245 

tac etc agg ctt gat gqa tec tec ace ate atg gat cgc cga gat 3789 
Tyr Leu Arg Leu Asp Gly Ser Ser Thr lie Met Asp Arg Arg Asp 
1250 1255 1260 

atg gtt agg gat ttt cag cat agg age gat att ttt gta ttc ttg 3834 
Met Val Arg Asp Phe Gin His Arg ser Asp lie Phe Val Phe Leu 
1265 1270 1275 

ctg age acc aga get gqa gqa ctt gqt ate aac ttg acg get gca 3879 
Leu ser Thr Arg Ala Gly Gly Leu Gly lie Asn Leu Thr Ala Ala 
1280 1285 1290 

gac act gtc att ttc tat gaa agt gat tgg aat ccc acc ttg gat 3924 
Asp Thr Val lie Phe Tyr Glu Ser Asp Trp Asn Pro Thr Leu Asp 
1295 1300 1305 

tta caa get atg gac agg get cat cgt ctt gqa cag aca aaa gat 3969 
Leu Gin Ala Met Asp Arg Ala His Arg Leu Gly Gin Thr Lys Asp 
1310 1315 1320 

gtt act gtt tat cgt etc ate tgt aag gag acg gtg gaa gag aaa 4014 
Val Thr Val Tyr Arg Leu lie Cys Lys Glu Thr Val Glu Glu Lys 
1325 1330 1335 

att ttg cac agg gca agt cag aaa aat aca gtt caa cag ctt gtt 4059 
lie Leu His Arg Ala Ser Gin Lys Asn Thr Val Gin Gin Leu Val 
1340 1345 1350 

atg act gga ggg cat gtt cag ggt gat gat ttt ctt gga get gcg 4104 
Met Thr Gly Gly His Val Gin Gly Asp Asp Phe Leu Gly Ala Ala 
1355 1360 1365 



gat gtg gta tct ctg eta atg gat gat gcg gag gca gca caa ctg 
Asp Val val ser Leu Leu Met Asp Asp Ala Glu Ala Ala Gin Leu 



4149 
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1370 1375 1380 

gag cag aaa ttc aga gaa eta cca tta cag gta aag gac agg cag 4194 
Glu Gin Lys Phe Arg Glu Leu Pro Leu Gin val Lys Asp Arg Gin 
1385 1390 1395 

aag aaa aag acg aaa cgt ate aga ata gat get gaa gga gat gca 4239 
Lys Lys Lys Thr Lys Arg lie Arg lie Asp Ala Glu Gly Asp Ala 
1400 1405 1410 

act ttg gaa gag tta gaa gat gtt gac cga cag gat aac gga cag 4284 
Thr Leu Glu Glu Leu Glu Asp Val Asp Arg Gin Asp Asn Gly Gin 
1415 1420 1425 

gaa cct ttg gaa gaa ccg gaa aag cca aaa tec agt aat aaa aag 4329 
Glu Pro Leu Glu Glu Pro Glu Lys Pro Lys Ser Ser Asn Lys Lys 
1430 1435 1440 

agg aga get get tea aat ccg aaa get aga get cct cag aaa gca 4374 
Arg Arg Ala Ala Ser Asn pro Lys Ala Arg Ala Pro Gin Lys Ala 
y 1445 1450 1455 

aag gaa gaa gca aat ggt gaa gat act cct cag agg aca aaa agg 4419 
Lys Glu Glu Ala Asn Gly Glu Asp Thr Pro Gin Arg Thr Lys Arg 
1460 1465 1470 

gta aag aga caa aca aag age ata aac gaa agt ctt gaa cct gta 4464 
Val Lys Arg Gin Thr Lys ser He Asn Glu ser Leu Glu Pro Val 
1475 1480 1485 

ttc tct gee tct gta aca gaa tea aat aaa gga ttc gat cca agt 4509 
Phe Ser Ala Ser Val Thr Glu Ser Asn Lys Gly Phe Asp Pro Ser 
1490 1495 1500 



age tec get aac taa 
Ser Ser Ala Asn 
1505 



<210> 


2 


<211> 


1507 


<212> 


PRT 


<213> 


Arabidopsis thai i ana 


<400> 


2 



Met Asp Pro Ser Arg Arg Pro Pro Lys Asp Ser Pro Tyr Ala Asn Leu 
15 10 15 

Phe Asp Leu Glu Pro Leu Met Lys Phe Arg lie Pro Lys Pro Glu Asp 
20 25 30 

Glu Val Asp Tyr Tyr Gly ser ser Ser Gin Asp Glu Ser Arg Ser Thr 
35 40 45 

Gin Gly Gly val val Ala Asn Tyr ser Asn Gly ser Lys Ser Arg Met 
50 55 60 



4524 
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Asn Ala ser ser Lys Lys Arg Lys Arg Trp Thr Glu Ala Glu Asp Ala 
65 70 75 80 

Glu asp Asp Asp Asp Leu Tyr Asn Gin His val Thr Glu Glu His Tyr 
85 90 95 

Arg ser Met Leu Gly Glu His val Gin Lys Phe Lys Asn Arg ser Lys 
100 105 110 

Glu Thr Gin Gly Asn Pro Pro His Leu Met Gly Phe Pro Val Leu Lys 
115 120 125 



Ser Asn val Gly ser Tyr Arg Gly Arg Lys Pro Gly Asn Asp Tyr His 

' 135 



130 



140 



Gly Arg Phe Tyr Asp Met Asp Asn ser Pro Asn Phe Ala Ala Asp val 
145 150 155 160 



Thr Pro His Arg Arg Gly ser Tyr His Asp Arg Asp lie Thr Pro Lys 
165 170 175 



He Ala Tyr Glu Pro ser Tyr Leu Asp lie Gly Asp Gly Val lie Tyr 
180 185 190 

Lys lie Pro pro ser Tyr Asp Lys Leu val Ala Ser Leu Asn Leu Pro 
195 200 205 

Ser Phe ser Asp lie His Val Glu Glu Phe Tyr Leu Lys Gly Thr Leu 
210 215 220 

Asp Leu Arg ser Leu Ala Glu Leu Met Ala Ser Asp Lys Arg ser Gly 
225 230 235 240 

val Arg ser Arg Asn Gly Met Gly Glu Pro Arg Pro Gin Tyr Glu Ser 
245 250 255 



Leu Gin Ala Arg Met Lys Ala Leu Ser Pro ser Asn ser Thr Pro Asn 
260 265 270 



Phe ser Leu Lys Val Ser Glu Ala Ala Met Asn Ser Ala lie Pro Glu 
275 280 285 

Gly ser Ala Gly Ser Thr Ala Arg Thr lie Leu Ser Glu Gly Gly Val 

290 295 300 

Leu Gin Val His Tyr Val Lys lie Leu Glu Lys Gly Asp Thr Tyr Glu 
305 310 315 320 



lie val Lys Arg Ser Leu Pro Lys Lys Leu Lys Ala Lys Asn Asp Pro 
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325 330 335 

Ala val lie Glu Lys Thr Glu Arg Asp Lys lie Arg Lys Ala Trp lie 
340 345 350 

Asn lie val Arg Arg Asp lie Ala Lys His His Arg lie Phe Thr Thr 
355 360 365 

Phe His Arg Lys Leu Ser lie Asp Ala Lys Arg Phe Ala Asp Gly Cys 
370 375 380 

Gin Arg Glu val Arg Met Lys val Gly Arg ser Tyr Lys He Pro Arg 
385 390 395 400 

Thr Ala Pro lie Arg Thr Arg Lys He ser Arg Asp Met Leu Leu Phe 
405 410 415 

Trp Lys Arg Tyr Asp Lys Gin Met Al| Glu Glu Arg Lys Lys Gin Glu 

Lys Glu Ala Ala Glu Ala Phe Lys Arg Glu Gin Glu Gin Arg Glu ser 
435 440 445 

Lys Arg Gin Gin Gin Arg Leu Asn Phe Leu lie Lys Gin Thr Glu Leu 
450 455 460 

Tvr Ser His Phe Met Gin Asn Lys Thr Asp ser Asn Pro ser Glu Ala 
465 470 475 480 

Leu Pro lie Gly Asp Glu Asn Pro lie Asp Glu val Leu Pro Glu Thr 
485 490 495 

Ser Ala Ala Glu Pro ser Glu val Glu Asp pro Glu Glu Ala Glu Leu 
500 505 510 

Lys Glu Lys val Leu Arg Ala Ala Gin Asp Ala Val Ser Lys Gin Lys 
515 520 525 

Gin lie Thr Asp Ala Phe Asp Thr Glu Tyr Met Lys Leu Arg Gin Thr 
530 535 540 

ser Glu Met Glu Gly Pro Leu Asn Asp He Ser Val ser Gly ser ser 
545 550 555 560 

Asn He Asp Leu His Asn Pro Ser Thr Met Pro Val Thr ser Thr Val 
565 570 575 

Gin Thr Pro Glu Leu Phe Lys Gly Thr Leu Lys Glu Tyr Gin Met Lys 
580 585 590 
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Gly Leu Gin Trp Leu Val Asn cys Tyr Glu Gin Gly Leu Asn Gly lie 
595 600 605 

Leu Ala Asp Glu Met Gly Leu Gly Lys Thr lie Gin Ala Met Ala Phe 
610 615 620 

Leu Ala His Leu Ala Glu Glu Lys Asn lie Trp Gly Pro Phe Leu val 
625 630 635 640 

Val Ala Pro Ala Ser Val Leu Asn Asn Trp Ala Asp Glu lie Ser Arg 
645 650 655 

Phe Cys Pro Asp Leu Lys Thr Leu Pro Tyr Trp Gly Gly Leu Gin Glu 
660 665 670 

Arg Thr lie Leu Arg Lys Asn lie Asn Pro Lys Arg Met Tyr Arg Arg 
675 680 685 

Asp Ala Gly Phe His lie Leu lie Thr Ser Tyr Gin Leu Leu Val Thr 
690 695 700 

Asp Glu Lys Tyr Phe Arg Arg Val Lys Trp Gin Tyr Met val Leu Asp 
705 710 715 720 

Glu Ala Gin Ala He Lys ser Ser Ser Ser lie Arg Trp Lys Thr Leu 
725 730 735 

Leu Ser Phe Asn cys Arg Asn Arg Leu Leu Leu Thr Gly Thr pro lie 
740 745 750 

Gin Asn Asn Met Ala Glu Leu Trp Ala Leu Leu His Phe lie Met Pro 
755 760 765 

Met Leu Phe Asp Asn His Asp Gin Phe Asn Glu Trp Phe Ser Lys Gly 
770 775 780 

lie Glu Asn His Ala Glu His Gly Gly Thr Leu Asn Glu His Gin Leu 
785 790 795 800 

Asn Arg Leu His Ala lie Leu Lys Pro Phe Met Leu Arg Arg val Lys 
805 810 815 

Lys Asp val Val Ser Glu Leu Thr Thr Lys Thr Glu val Thr Val His 
820 825 830 

cys Lys Leu ser ser Arg Gin Gin Ala Phe Tyr Gin Ala lie Lys Asn 
835 840 845 



Lys He ser Leu Ala Glu Leu Phe Asp ser Asn Arg Gly Gin Phe Thr 
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850 



855 



860 



Asp Lys Lys val Leu Asn Leu Met Asn lie val lie Gin Leu Arg Lys 
865 870 875 880 



val cys Asn His Pro Glu Leu Phe Glu Arg Asn Glu Gly ser ser Tyr 
885 890 895 



Leu Tyr Phe Gly val Thr ser Asn Ser Leu Leu Pro His Pro phe Gly 
900 905 910 



Glu Leu Glu Asp val His Tyr Ser Gly Gly Gin Asn Pro lie lie Tyr 
915 920 925 



Lys lie Pro Lys Leu Leu His Gin Glu Val Leu Gin Asn ser Glu Thr 
930 935 940 



Phe cys ser ser val Gly Arg Gly lie Ser Arg Glu ser Phe Leu Lys 
945 950 955 960 



His Phe Asn lie Tyr Ser Pro Glu Tyr lie Leu Lys ser He Phe Pro 
965 970 975 



ser Asp Ser Gly val Asp Gin val val Ser Gly ser Gly Ala phe Gly 
980 985 990 



Phe Ser Arg Leu Met Asp Leu Ser Pro ser Glu val Gly Tyr Leu Ala 
995 1000 1005 



Leu Cys Ser val Ala Glu Arg Leu Leu Phe Ser lie Leu Arg Trp 
1010 1015 1020 



Glu Arg Gin Phe Leu Asp Glu Leu Val Asn Ser Leu Met Glu ser 
1025 1030 1035 



Lys Asp Gly Asp Leu ser Asp Asn Asn lie Glu Arg val Lys Thr 
1040 1045 1050 



Lys Ala val Thr Arg Met Leu Leu Met Pro Ser Lys Val Glu Thr 
1055 1060 1065 



Asn Phe Gin Lys Arg Arg Leu ser Thr Gly Pro Thr Arg Pro Ser 
1070 1075 1080 



Phe Glu Ala Leu val He ser His Gin Asp Arg Phe Leu Ser ser 
1085 1090 1095 



lie Lys Leu Leu His Ser Ala Tyr Thr Tyr lie Pro Lys Ala Arg 



1100 



1105 



1110 
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Ala Pro Pro Val Ser lie His cys ser Asp Arg Asn ser Ala Tyr 
1115 1120 1125 

Arg val Thr Glu Glu Leu His Gin Pro Trp Leu Lys Arg Leu Leu 
y 1130 1135 1140 

lie Gly Phe Ala Arg Thr ser Glu Ala Asn Gly Pro Arg Lys Pro 
1145 1150 1155 

Asn ser Phe Pro His Pro Leu lie Gin Glu lie Asp ser Glu Leu 
1160 U65 1170 



Pro val Val Gin Pro Ala Leu Gin Leu Thr His Arg lie Phe Gly 
1175 1180 U85 



Ser Cys Pro Pro Met Gin ser Phe Asp Pro Ala Lys Leu Leu Thr 
1190 1195 1200 

Asp Ser Gly Lys Leu Gin Thr Leu Asp lie Leu Leu Lys Arg Leu 
K 1205 1210 1215 

Arg Ala Gly Asn His Arg val Leu Leu Phe Ala Gin Met Thr Lys 
y 1220 1225 1230 



Met Leu Asn lie Leu Glu Asp Tyr Met Asn Tyr Arg Lys Tyr Lys 
1235 1240 1245 



Tyr Leu Arg Leu Asp Gly Ser ser Thr lie Met Asp Arg Arg Asp 

1250 1255 1260 

Met val Arg Asp Phe Gin His Arg ser Asp lie Phe val Phe Leu 

1265 1270 1275 



Leu ser Thr Arg Ala Gly Gly Leu Gly He Asn Leu Thr Ala Ala 
1280 ~ 1285 1290 



Asp Thr val lie Phe Tyr Glu ser Asp Trp Asn Pro Thr Leu Asp 
1295 1300 1305 

Leu Gin Ala Met Asp Arg Ala His Arg Leu Gly Gin Thr Lys Asp 
1310 1315 1320 

val Thr Val Tyr Arg Leu lie Cys Lys Glu Thr val Glu Glu Lys 
1325 1330 1335 

He Leu His Arg Ala Ser Gin Lys Asn Thr val Gin Gin Leu val 
1340 1345 1350 



Met Thr Gly Gly His Val Gin Gly Asp Asp Phe Leu Gly Ala Ala 
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1355 1360 1365 

Asp val val ser Leu Leu Met Asp Asp Ala Glu Ala Ala Gin Leu 
1370 1375 1380 

Glu Gin Lys Phe Arg Glu Leu Pro Leu Gin val Lys Asp Arg Gin 
1385 ~ 1390 1395 

Lys Lys Lys Thr Lys Arg lie Arg lie Asp Ala Glu Gly Asp Ala 
1400 1405 1410 

Thr Leu Glu Glu Leu Glu Asp Val Asp Arg Gin Asp Asn Gly Gin 
1415 1420 1425 

Glu Pro Leu Glu Glu Pro Glu Lys Pro Lys Ser Ser Asn Lys Lys 
1430 1435 1440 

Arg Arg Ala Ala Ser Asn Pro Lys Ala Arg Ala Pro Gin Lys Ala 
1445 1450 1455 

Lys Glu Glu Ala Asn Gly Glu Asp Thr Pro Gin Arg Thr Lys Arg 
1460 1465 1470 

val Lys Arg Gin Thr Lys Ser lie Asn Glu Ser Leu Glu Pro Val 
1475 1480 1485 

Phe Ser Ala Ser val Thr Glu ser Asn Lys Gly Phe Asp Pro Ser 
1490 1495 1500 

Ser Ser Ala Asn 
1505 

<210> 3 

<211> 4492 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 3 

atggatcctt caagacgacc accgaaggac tctccttacg cgaatctatt cgatctcgag 60 

ccgttgatga agtttagaat tccgaaacct gaagatgaag ttgattatta tgggagtagt 120 

agccaggatg aaagtagaag cactcaaggt ggggtagtgg caaactacag caatgggtct 180 

aaatcgagaa tgaatgcgag ctccaagaag agaaagcggt ggacagaagc tgaggatgca 240 

gaggacgatg atgatctcta caatcaacat gttactgagg agcactaccg atcaatgctt 300 

ggggagcatg tacaaaaatt caaaaatagg tccaaggaga ctcaagggaa tcctcctcat 360 
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ctgatgggtt ttccggtgct 


aaagagcaat 


gtgggcagtt 


acagaggtag 


gaaaccaggg 


420 


aatgattacc atgggaggtt 


ctatgacatg 


gacaactctc 


caaattttgc 


agctgatgtg 


480 


accccacata ggcgaggaag 


ctaccatgat 


cgtgatatta 


cacccaagat 


agcatatgaa 


540 


ccttcgtatt tggacattgg 


tgatggtgtc 


atctacaaaa 


tccccccaag 


ttatgacaag 


600 


ctggtggcat cattaaactt 


accgagcttt 


tcagacattc 


atgtggaaga 


attttacttg 


660 


aaaggaactc tggatctgag 


atcattagca 


gaactgatgg 


caagtgataa 


aaggtctgga 


720 


gtaagaagcc gtaatggaat 


gggtgagcct 


cgacctcaat 


atgaatctct 


tcaagctaga 


780 


atgaaggccc tgtcaccttc 


aaactccacc 


ccaaatttta 


gcctcaaggt 


gtcagaagct 


840 


gcaatgaatt ctgccattcc 


agaaggatct 


gctggaagta 


ctgcacggac 


aattctgtct 


900 


gagggtggtg ttttacaggt 


ccattacgtg 


aagattctgg 


agaaggggga 


tacatacgag 


960 


attgttaaac gaagtctacc 


gaagaagctg 


aaagcaaaga 


atgatcctgc 


agtcattgag 


1020 


aaaacagaaa gggataaaat 


tagaaaagcc 


tggatcaata 


ttgtcagaag 


agatatagca 


1080 


aaacaccata gaattttcac 


tacttttcat 


cgtaaactat 


caattgatgc 


caagaggttt 


1140 


gcagatggtt gccaaagaga 


ggtgagaatg 


aaggtgggta 


gatcatacaa 


aatcccaaga 


1200 


actgcaccaa ttcgcactag 


gaagatatcc 


agagacatgc 


tgctattctg 


gaagcgatat 


1260 


gacaagcaga tggcagaaga 


gaggaaaaag 


caagaaaagg 


aagctgcaga 


ggcttttaaa 


1320 


cgtgaacagg agcagcgaga 


gtcaaaaagg 


cagcaacaaa 


ggctcaattt 


ccttattaaa 


1380 


cagactgagc tttacagtca 


cttcatgcaa 


aacaagaccg 


attcgaatcc 


ttccgaagcc 


1440 


ttaccaatag gtgatgaaaa 


tccgattgac 


gaagtgctcc 


cagaaacttc 


agcggcagaa 


1500 


ccttctgagg tagaggatcc 


tgaagaggct 


gaactgaagg 


aaaaggtctt 


gagagctgcc 


1560 


caagatgcgg tgtctaagca 


gaagcaaata 


acagatgcat 


ttgacactga 


atatatgaag 


1620 


ctacgccaaa cttctgaaat 


ggaaggtcct 


ttaaatgata 


tatcagtttc 


tggctcgagc 


1680 


aatatagatt tgcataaccc 


atctacaatg 


cctgttacat 


caacagttca 


gactccagag 


1740 


ttatttaaag gaacccttaa 


agaataccaa 


atgaaaggcc 


ttcagtggct 


agtcaattgt 


1800 


tatgagcagg gtttgaatgg 


catacttgct 


gatgaaatgg 


gcttgggtaa 


gactattcaa 


1860 


gctatggcgt tcttggcaca 


tttggctgag 


gaaaagaaca 


tttggggtcc 


atttcttgtt 


1920 


gttgcccctg cctctgttct 


taacaattgg 


gctgatgaaa 


tcagtcgttt 


ctgtcctgac 


1980 


ttgaaaactc ttccatattg 


gggaggatta 


caagaacgaa 


caattttaag 


aaagaatatc 


2040 


aatcccaagc gtatgtaccg 


aagggatgct 


ggctttcata 


ttttgattac 


tagctatcag 


2100 


ctattagtca ctgatgaaaa 


gtattttcgc 


cgggtgaagt 


ggcaatatat 


ggtgctagat 


2160 


gaggcccaag caatcaagag 


ttcctccagt 


ataagatgga 


aaacccttct 


tagttttaac 


2220 


tgtcggaacc gattgcttct 


gactggtact 


ccaattcaga 


acaacatggc 


agagttatgg 


2280 


gccctgctgc atttcatcat 


gccaatgttg 


tttgacaacc 


atgatcaatt 


taatgaatgg 


2340 
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ttctcaaaag 


gaattgagaa 


tcatgctgaa 


cacggaggca 


ctttaaatga 


gcaccagctt 


2400 


aacagactgc 


atgcgatctt 


gaaaccgttc 


atgcttcgac 


gggtaaaaaa 


ggatgtggtt 


2460 


tctgagctaa 


ctacaaagac 


ggaagttaca 


gtacactgca 


agctcagttc 


tcgacaacaa 


2520 


gctttttatc 


aggctattaa 


gaacaaaatt 


tctctggctg 


agttgtttga 


tagcaaccgc 


2580 


ggacaattta 


ctgataagaa 


agtattgaat 


ttaatgaata 


ttgtcattca 


actaaggaag 


2640 


gtttgcaacc 


atccagagtt 


gttcgaaagg 


aatgaaggga 


gctcgtatct 


ctactttgga 


2700 


gtgacttcca 


attctctttt 


gccccatccc 


tttggtgagc 


tagaggatgt 


acattattct 


2760 


ggtggtcaaa 


atccgataat 


atacaagata 


cctaagctac 


tacaccaaga 


ggtgctccaa 


2820 


aattctgaaa 


cattttgttc 


ttctgtcggg 


cgtggcatct 


caagagaatc 


ttttctgaag 


2880 


cattttaata 


tatattcacc 


tgagtatatt 


cttaagtcaa 


tattcccatc 


tgatagtggg 


2940 


gtagatcaag 


tggttagtgg 


aagtggagca 


tttggctttt 


cacgcttgat 


ggatctatca 


3000 


ccatcagaag 


ttggatatct 


ggctctgtgt 


tctgttgcag 


aaaggctatt 


attttctata 


3060 


ctgaggtggg 


agcggcaatt 


tttggatgaa 


ttagttaact 


ctcttatgga 


gtccaaggat 


3120 


ggtgatctta 


gtgacaataa 


catcgagaga 


gttaaaacca 


aagctgtcac 


aagaatgttg 


3180 


ctgatgccat 


caaaagttga 


aacgaatttt 


cagaaaagga 


gactaagcac 


agggcctacc 


3240 


cgtccttcat 


ttgaagcgct 


agtgatctct 


catcaggata 


ggtttctttc 


aagtatcaaa 


3300 


ctcctgcatt 


ctgcatatac 


ttatatccca 


aaagccagag 


ctccacctgt 


aagcattcat 


3360 


tgctcggaca 


gaaattcggc 


atacagagtt 


acagaagaat 


tacatcaacc 


atggcttaag 


3420 


agactattaa 


tcggttttgc 


acgaacgtca 


gaagctaatg 


gacccaggaa 


gcctaacagc 


3480 


tttccacatc 


ctttaatcca 


agaaattgat 


tcagaacttc 


cagttgtgca 


gcctgcgctt 


3540 


caactgacac 


acagaatatt 


tggttcttgc 


cctccaatgc 


aaagttttga 


cccagcaaag 


3600 


ttgctcacgg 


actctgggaa 


gctgcagaca 


cttgatatat 


tattgaagcg 


gcttcgagct 


3660 


ggaaatcaca 


gggtgctcct 


gtttgcacaa 


atgacaaaga 


tgctgaacat 


tctcgaggat 


3720 


tatatgaact 


atagaaagta 


caagtacctc 


aggcttgatg 


gatcctccac 


catcatggat 


3780 


cgccgagata 


tggttaggga 


ttttcagcat 


aggagcgata 


tttttgtatt 


cttgctgagc 


3840 


accagagctg 


gaggacttgg 


tatcaacttg 


acggctgcag 


acactgtcat 


tttctatgaa 


3900 


agtgattgga 


atcccacctt 


ggatttacaa 


gctatggaca 


gggctcatcg 


tcttggacag 


3960 


acaaaagatg 


gagacggtgg 


aagagaaaat 


tttgcacagg 


gcaagtcaga 


aaaatacagt 


4020 


tcaacagctt 


gttatgactg 


gagggcatgt 


tcagggtgat 


gattttcttg 


gagctgcgga 


4080 


tgtggtatct 


ctgctaatgg 


atgatgcgga 


ggcagcacaa 


ctggagcaga 


aattcagaga 


4140 


actaccatta 


caggacaggc 


agaagaaaaa 


gacgaaacgt 


atcagaatag 


atgctgaagg 


4200 


agatgcaact 


ttggaagagt 


tagaagatgt 


tgaccgacag 


gataacggac 


aggaaccttt 


4260 


ggaagaaccg 


gaaaagccaa 


aatccagtaa 


taaaaagagg 


agagctgctt 


caaatccgaa 


4320 
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agctagagct cctcagaaag caaaggaaga agcaaatggt gaagatactc ctcagaggac 4380 
aaaaagggta aagagacaaa caaagagcat aaacgaaagt cttgaacctg tattctctgc 4440 
ctctgtaaca gaatcaaata aaggattcga tccaagtagc tccgctaact aa 4492 

<210> 4 

<211> 1377 

<212> DNA 

<213> Arabidopsis thai i ana 
<220> 

<221> CDS 

<222> (1).-(1377) 

<223> 



<400> 4 

atg gag aaa gta aag att gaa gaa att cag tec acc get aag aaa caa 48 
Met Glu Lys val Lys lie Glu Glu He Gin ser Thr Ala Lys Lys Gin 
15 10 15 

egg att get act cac acc cat ate aaa ggc ctt ggc etc gag cca act 96 
Arg lie Ala Thr His Thr His lie Lys Gly Leu Gly Leu Glu Pro Thr 
20 ' 25 30 

ggt ate cct ata aaa ttg gca get gga ttt gtt ggt caa ctt gag get 
Gly lie Pro lie Lys Leu Ala Ala Gly Phe Val Gly Gin Leu Glu Ala 
35 40 45 

aga gag gca get ggt ctt gta gtt gac atg att aag cag aag aaa atg 
Arg Glu Ala Ala Gly Leu val val Asp Met lie Lys Gin Lys Lys Met 
50 55 60 

gcg ggc aag get ctt ttg ctt get gga cct cct gga act ggg aaa aca 
Ala Gly Lys Ala Leu Leu Leu Ala Gly pro Pro Gly Thr Gly Lys Thr 
65 70 75 80 

get ttg get ctt gga ate tct caa gag ctg gga age aag gtt cca ttc 288 
Ala Leu Ala Leu Gly lie ser Gin Glu Leu Gly Ser Lys Val Pro Phe 
85 90 95 



144 



192 



240 



tgt cca atg gtt gga tct gag gtt tac tea tea gag gtt aag aaa aca 336 

Cys Pro Met val Gly ser Glu val Tyr ser Ser Glu val Lys Lys Thr 
100 105 110 

gag gtt etc atg gag aat ttt aga cgt gee att ggt eta cgt ate aag 384 

Glu val Leu Met Glu Asn phe Arg Arg Ala lie Gly Leu Arg lie Lys 
115 120 125 

gaa acc aaa gaa gtc tat gaa ggg gag gtc acc gag ctg tea cca gaa 432 

Glu Thr Lys Glu Val Tyr Glu Gly Glu val Thr Glu Leu ser Pro Glu 

130 135 140 

gaa act gaa age etc act gga ggt tat ggt aaa age ate age cat gtt 480 

Glu Thr Glu ser Leu Thr Gly Gly Tyr Gly Lys Ser He Ser His Val 
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145 150 155 160 

gta att aca etc aag aca gtc aaa gga acc aaa cat ctg aaa ttg gat 528 
Val lie Thr Leu Lys Thr val Lys Gly Thr Lys His Leu Lys Leu Asp 
165 170 175 

ccc act ate tat gat gec ttg att aag gaa aag gta get gta gga gat 576 
pro Thr lie Tyr Asp Ala Leu lie Lys Glu Lys val Ala val Gly Asp 
180 185 190 



gta ate tat ate gaa gca aac agt gga get gtc aaa egg gta ggt aga 
val lie Tyr lie Glu Ala Asn ser Gly Ala Val Lys Arg val Gly Arg 
195 200 205 



cca ctt ccc aaa gga gag gtc cac aaa aag aaa gag ata gtg cag gat 
pro Leu Pro Lys Gly Glu val His Lys Lys Lys Glu He val Gin Asp 
225 230 235 240 



gaa tta acc gtg gat gaa gaa tgc ttg gtt eta ctt ggg gag att ggg 

Glu Leu Thr Val Asp Glu Glu cys Leu val Leu Leu Gly Glu lie Gly 

385 390 395 400 

caa aga act tea eta agg cac get gtg cag ctt ctg tct cct gee age 

Gin Arg Thr Ser Leu Arg His Ala Val Gin Leu Leu Ser Pro Ala Ser 

405 410 415 



624 



agt gat get ttt gec act gaa ttt gat ctg gaa gca gaa gaa tat gtt 672 
Ser Asp Ala Phe Ala Thr Glu Phe Asp Leu Glu Ala Glu Glu Tyr val 
210 215 220 



720 



768 



816 



gtc aca etc caa gat ctg gat gca gca aat get cga cct caa ggt ggc 
Val Thr Leu Gin Asp Leu Asp Ala Ala Asn Ala Arg Pro Gin Gly Gly 
245 250 255 

cag gat ata ctt tct ttg atg ggc caa atg atg aaa ccg egg aag act 
Gin Asp lie Leu ser Leu Met Gly Gin Met Met Lys Pro Arg Lys Thr 
260 265 270 

gag ate act gat aag ctt egg caa gaa att aac aag gtt gtg aac cga 864 
Glu lie Thr Asp Lys Leu Arg Gin Glu lie Asn Lys Val Val Asn Arg 
275 280 285 

tat ata gat gaa ggt gtg gca gag ctt gtt cca gga gtt eta ttt att 912 
Tyr lie Asp Glu Gly val Ala Glu Leu Val Pro Gly Val Leu Phe He 
290 295 300 

gat gag gtt cat atg ctt gat atg gag tgc ttc tea tac ttg aac cgt 960 
Asp Glu Val His Met Leu Asp Met Glu cys Phe Ser Tyr Leu Asn Arg 
305 310 315 320 

get ctt gag age tea tta tct ccg ata gtg ata ttt gca aca aat aga 1008 
Ala Leu Glu ser Ser Leu ser Pro lie val lie Phe Ala Thr Asn Arg 
325 330 335 

ggt gtt tgc aac gta aga ggg act gat atg ccc age ccc cat gga gtc 1056 
Gly val cys Asn Val Arg Gly Thr Asp Met Pro Ser Pro His Gly Val 
340 345 350 

cct att gat eta tta gat cga ttg gtt ate ate egg act caa ate tat 1104 
pro lie Asp Leu Leu Asp Arg Leu Val lie lie Arg Thr Gin lie Tyr 
355 360 365 

gat ccc tct gaa atg ate cag att ata gee att cgt gcg caa gtt gaa 1152 
Asp Pro ser Glu Met lie Gin lie lie Ala lie Arg Ala Gin val Glu 
370 375 380 



1200 



1248 
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att gta gcg aaa atg aat ggc cgt gac aat att tgc aag get gat ata 1296 
lie Val Ala Lys Met Asn Gly Arg Asp Asn lie cys Lys Ala Asp lie 
420 . 425 430 

gag gaa gta aca tea etc tac ttg gat get aaa tct tea gca aag ctt 1344 
Glu Glu val Thr ser Leu Tyr Leu Asp Ala Lys ser ser Ala Lys Leu 
435 440 445 

ttg cat gag caa caa gaa aaa tac ate tea tga 1377 
Leu His Glu Gin Gin Glu Lys Tyr lie Ser 
450 455 

<210> 5 

<211> 458 

<212> PRT 

<213> Arabidopsis thai i ana 



<400> 5 

Met Glu Lys val Lys lie Glu Glu He Gin ser Thr Ala Lys Lys Gin 
1 5 10 15 

Arg lie Ala Thr His Thr His lie Lys Gly Leu Gly Leu Glu Pro Thr 
20 25 30 

Gly He Pro lie Lys Leu Ala Ala Gly Phe val Gly Gin Leu Glu Ala 
35 40 45 

Arg Glu Ala Ala Gly Leu val val Asp Met lie Lys Gin Lys Lys Met 
50 55 60 

Ala Gly Lys Ala Leu Leu Leu Ala Gly Pro Pro Gly Thr Gly Lys Thr 
65 70 75 80 

Ala Leu Ala Leu Gly lie Ser Gin Glu Leu Gly Ser Lys val Pro Phe 
85 90 95 

Cys Pro Met val Gly ser Glu Val Tyr ser ser Glu Val Lys Lys Thr 
100 105 110 

Glu Val Leu Met Glu Asn Phe Arg Arg Ala lie Gly Leu Arg lie Lys 
115 120 125* 

Glu Thr Lys Glu Val Tyr Glu Gly Glu Val Thr Glu Leu Ser Pro Glu 
130 135 140 

Glu Thr Glu Ser Leu Thr Gly Gly Tyr Gly Lys ser lie Ser His val 
145 150 155 160 

val lie Thr Leu Lys Thr Val Lys Gly Thr Lys His Leu Lys Leu Asp 
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165 



170 



175 



pro Thr He Tyr Asp Ala Leu He Lys Glu Lys Val Ala Val Gly Asp 
180 185 190 



val He Tyr He Glu Ala Asn Ser Gly Ala val Lys Arg val Gly Arg 
195 200 205 



Ser Asp Ala Phe Ala Thr Glu Phe Asp Leu Glu Ala Glu Glu Tyr val 
210 215 220 



pro Leu Pro Lys Gly Glu Val His Lys Lys Lys Glu lie Val Gin Asp 
225 230 235 240 



val Thr Leu Gin Asp Leu Asp Ala Ala Asn Ala Arg Pro Gin Gly Gly 
245 250 255 



Gin Asp He Leu ser Leu Met Gly Gin Met Met Lys Pro Arg Lys Thr 
260 265 270 



Glu lie Thr Asp Lys Leu Arg Gin Glu lie Asn Lys Val val Asn Arg 
275 280 285 



Tyr He Asp Glu Gly Val Ala Glu Leu Val Pro Gly Val Leu Phe lie 
290 295 300 



Asp Glu val His Met Leu Asp Met Glu Cys Phe ser Tyr Leu Asn Arg 
305 310 315 320 



Ala Leu Glu ser Ser Leu Ser pro lie val lie Phe Ala Thr Asn Arg 
325 330 335 



Gly val Cys Asn Val Arg Gly Thr Asp Met Pro Ser Pro His Gly val 
y 7 340 345 350 



Pro lie Asp Leu Leu Asp Arg Leu Val lie lie Arg Thr Gin lie Tyr 
355 360 365 



pro Ser Glu Met lie Gin lie He Ala lie Arg Ala Gin Val Glu 
370 375 380 



Glu Leu Thr val Asp Glu Glu Cys Leu Val Leu Leu Gly Glu He Gly 
385 390 395 400 



Gin Arg Thr Ser Leu Arg His Ala Val Gin Leu Leu Ser Pro Ala ser 
405 410 415 



He Val Ala Lys Met Asn Gly Arg Asp Asn lie Cys Lys Ala Asp lie 



420 




430 
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Glu Glu Val Thr Ser Leu Tyr Leu Asp Ala Lys ser ser Ala Lys Leu 
435 440 445 

Leu His Glu Gin Gin Glu Lys Tyr lie Ser 
450 455 

<210> 6 

<211> 1584 

<212> DNA 

<213> Arabidopsis thai i ana 



<220> 

<221> CDS 

<222> (1) . . C1410) 

<223> 



<400> 6 

atg gcg gaa eta aag eta tea gag agt egg gac tta ace aga gtc gag 
Met Ala Glu Leu Lys Leu Ser Glu ser Arg Asp Leu Thr Arg val Glu 
15 10 15 

cga ate ggc gca cac tea cac ate aga gga eta ggt etc gac tct gee 
Arg lie Gly Ala His Ser His lie Arg Gly Leu Gly Leu Asp Ser Ala 
20 25 30 

etc gag ccg cga get gtt tec gaa ggt atg gtc ggt caa gtg aag gcg 
Leu Glu Pro Arg Ala val ser Glu Gly Met val Gly Gin Val Lys Ala 
35 40 45 

cgt aaa gec gee ggt gta ate ctt cag atg att aga gaa ggg aaa ate 
Arg Lys Ala Ala Gly Val He Leu Gin Met lie Arg Glu Gly Lys lie 
50 55 60 

gcg ggt egg get att eta ata gcg ggt caa ccc gga acg ggt aag aca 
Ala Gly Arg Ala He Leu lie Ala Gly Gin Pro Gly Thr Gly Lys Thr 
65 70 75 80 

gcg att gca atg ggt atg gcg aaa tct ctt ggc ttg gaa act cct ttt 
Ala He Ala Met Gly Met Ala Lys Ser Leu Gly Leu Glu Thr Pro Phe 
85 90 95 

gcg atg att gca gga agt gaa att ttc tea tta gag atg tea aag aca 
Ala Met lie Ala Gly Ser Glu lie Phe ser Leu Glu Met Ser Lys Thr 
100 105 110 

gaa get ttg act cag tct ttt cgt aaa gcg att ggt gtt agg ate aaa 
Glu Ala Leu Thr Gin ser Phe Arg Lys Ala lie Gly Val Arg lie Lys 
115 120 125 

gaa gag aca gag gtt att gaa gga gaa gtt gtt gag gtt cag att gat 
Glu Glu Thr Glu val lie Glu Gly Glu Val Val Glu Val Gin lie Asp 
130 135 140 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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agg cct get tct tct ggt gtt get tec aag tea gag aag atg act atg 

Arq Pro Ala Ser Ser Gly val Ala ser Lys ser Gly Lys Met Thr Met 

145 150 155 160 

aaa acg act gat atg gaa act gtg tat gat atg gga get aag atg att 

Lys Thr Thr Asp Met Glu Thr Val Tyr Asp Met Gly Ala Lys Met lie 
165 170 175 

gag get ttg aac aag gag aaa gtg cag agt ggt gat gtt att gee att 

Glu Ala Leu Asn Lys Glu Lys Val Gin Ser Gly Asp val lie Ala lie 
180 185 190 

gat aaa get act ggg aag att act aag ctt gga aga teg ttt teg agg 

Asp Lys Ala Thr Gly Lys lie Thr Lys Leu Gly Arg ser Phe Ser Arg 
195 200 205 



tct cgt gat tat gat get atg ggt gcg cag acc aag ttt gtg cag tgc 
ser Arg Asp Tyr Asp Ala Met Gly Ala Gin Thr Lys Phe Val Gin Cys 
210 215 220 



cct gaa ggt gag ttg cag aag agg aaa gag gtt gta cat tgt gtc act 

pro Glu Gly Glu Leu Gin Lys Arg Lys Glu Val Val Hns cys Val Thr 

225 230 235 240 

cac gag att gat gtt ate aac age agg aca caa ggg ttt ctt gee 

His Glu lie Asp val lie Asn Ser Arg Thr Gin Gly Phe Leu Ala 



ctt 

Leu _ . - , 

245 250 255 



480 



528 



576 



624 



672 



720 



768 



816 



ctt ttc act ggc gat act gga gaa ate cga tea gaa gtc egg gaa caa 
Leu Phe Thr Gly Asp Thr Gly Glu lie Arg ser Glu val Arg Glu Gin 
260 265 270 

att gat aca aaa gta get gaa tgg aga gaa gaa gga aaa gca gag ata 
lie Asp Thr Lys val Ala Glu Trp Arg Glu Glu Gly Lys Ala Glu lie 
275 280 285 

gtt ccc gga gtt etc ttc att gat gaa gtc cac atg etc gac ate gaa 
Val Pro Gly Val Leu Phe lie Asp Glu Val His Met Leu Asp lie Glu 
290 295 300 

tgc ttc tea ttc ctt aac cga get eta gaa aac gaa atg tea cca ate 
Cys Phe ser Phe Leu Asn Arg Ala Leu Glu Asn Glu Met Ser Pro lie 
305 310 315 320 

ctt gtg gtg gca aca aac cga gga gtg acg aca ate cgt ggc aca aac 
Leu val val Ala Thr Asn Arg Gly Val Thr Thr lie Arg Gly Thr Asn 
325 330 335 

cag aaa tea cca cac ggg ate ccg att gat etc ctt gac cgt ctt etc 
Gin Lys ser Pro His Gly lie Pro He Asp Leu Leu Asp Arg Leu Leu 
340 345 350 

ate ate act acc caa cct tac aca gac gat gac ata agg aag ata tta 1104 
lie lie Thr Thr Gin Pro Tyr Thr Asp Asp Asp lie Arg Lys He Leu 
355 360 365 

gaa ate cgt tgc caa gag gaa gac gtt gag atg aac gaa gag gee aaa 1152 
Glu lie Arg Cys Gin Glu Glu Asp Val Glu Met Asn Glu Glu Ala Lys 
370 375 380 

cag ctt ttg aca ttg ate gga cgt gat aca tct eta agg tat gcg att 
Gin Leu Leu Thr Leu lie Gly Arg Asp Thr ser Leu Arg Tyr Ala He 
385 390 395 400 

cat ctt ata acc gca get gca ttg tea tgc cag aaa egg aaa ggg aaa 
His Leu lie Thr Ala Ala Ala Leu ser Cys Gin Lys Arg Lys Gly Lys 



864 



912 



960 



1008 



1056 



1200 



1248 
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405 



410 



415 



gtc gtg gag gtt gag gat att cag aga gtt tac aga ctg ttc ttg gat 1296 
Val val Glu Val Glu Asp He Gin Arg Val Tyr Arg Leu Phe Leu Asp 
420 425 430 

gtg agg aga teg atg cag tat ctt gtt gag tat cag agt cag tat atg 1344 
Val Arg Arg Ser Met Gin Tyr Leu Val Glu Tyr Gin Ser Gin Tyr Met 
435 440 445 

ttc agt gaa cca ate aaa aac gat gaa get get gca gaa gac gaa caa 1392 
Phe Ser Glu Pro He Lys Asn Asp Glu Ala Ala Ala Glu Asp Glu Gin 
450 455 460 

gat get atg cag ate tga ggatccacct ctgtttgcct tatttatcat 1440 
Asp Ala Met Gin lie 



gtttcgtggt gatatgtatg attaggatgt tgaactcgga tttatgtttt ttttttttta 1500 
agttgtgacg agatteggtt ctagaaaatg atttaaccaa gttcaataca gatcggtttg 1560 



<210> 7 

<211> 469 

<212> PRT 

<213> Arabidopsis thai i ana 



<400> 7 

Met Ala Glu Leu Lys Leu ser Glu ser Arg Asp Leu Thr Arg Val Glu 
15 10 15 



Arg lie Gly Ala His Ser His lie Arg Gly Leu Gly Leu Asp ser Ala 
20 25 30 



Leu Glu Pro Arg Ala Val ser Glu Gly Met Val Gly Gin Val Lys Ala 
35 " 40 45 



Arg Lys Ala Ala Gly Val lie Leu Gin Met lie Arg Glu Gly Lys lie 
50 55 60 



Ala Gly Arg Ala lie Leu. lie Ala Gly Gin Pro Gly Thr Gly Lys Thr 
65 70 75 80 



Ala lie Ala Met Gly Met Ala Lys ser Leu Gly Leu Glu Thr pro Phe 
85 90 95 



Ala Met lie Ala Gly ser Glu lie Phe ser Leu Glu Met ser Lys Thr 
100 105 110 



Glu Ala Leu Thr Gin ser Phe Arg Lys Ala He Gly Val Arg lie Lys 




gtaeaaaaca aaaaaaaaaa aaaa 



1584 
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115 



120 



125 



Glu Glu Thr Glu val lie Glu Gly Glu val val Glu val Gin lie Asp 
130 135 140 



Arg Pro Ala Ser Ser Gly val Ala Ser Lys Ser Gly Lys Met Thr Met 
145 150 155 160 



Lys Thr Thr Asp Met Glu Thr Val Tyr Asp Met Gly Ala Lys Met lie 
165 170 175 

Glu Ala Leu Asn Lys Glu Lys Val Gin Ser Gly Asp val lie Ala lie 
180 185 190 



Asp Lys Ala Thr Gly Lys lie Thr Lys Leu Gly Arg ser Phe ser Arg 
195 200 205 



ser Arg Asp Tyr Asp Ala Met Gly Ala Gin Thr Lys phe val Gin cys 
210 215 220 



Pro Glu Gly Glu Leu Gin Lys Arg Lys Glu Val Val His Cys Val Thr 
225 230 235 240 



Leu His Glu lie Asp Val lie Asn Ser Arg Thr Gin Gly Phe Leu Ala 
245 250 255 



Leu Phe Thr Gly Asp Thr Gly Glu lie Arg ser Glu Val Arg Glu Gin 
260 265 270 



lie Asp Thr Lys Val Ala Glu Trp Arg Glu Glu Gly Lys Ala Glu He 
275 280 285 



val Pro Gly val Leu Phe lie Asp Glu val His Met Leu Asp lie Glu 
290 295 300 



Cys Phe ser Phe Leu Asn Arg Ala Leu Glu Asn Glu Met Ser Pro lie 
305 310 315 320 



Leu Val val Ala Thr Asn Arg Gly val Thr Thr lie Arg Gly Thr Asn 
325 330 335 



Gin Lys Ser Pro His Gly lie Pro lie Asp Leu Leu Asp Arg Leu Leu 
340 345 350 



lie lie Thr Thr Gin Pro Tyr Thr Asp Asp Asp lie Arg Lys He Leu 
355 360 365 



Glu He Arg Cys Gin Glu Glu Asp Val Glu Met Asn Glu Glu Ala Lys 



370 



375 



380 
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Gin Leu Leu Thr Leu lie Gly Arg Asp Thr ser Leu Arg Tyr Ala lie 
385 390 395 400 

His Leu lie Thr Ala Ala Ala Leu Ser cys Gin Lys Arg Lys Gly Lys 
405 410 415 

val val Glu val Glu Asp lie Gin Arg val Tyr Arg Leu Phe Leu Asp 
420 425 430 

Val Arg Arg ser Met Gin Tyr Leu Val Glu Tyr Gin Ser Gin Tyr Met 
435 440 445 

Phe Ser Glu Pro lie Lys Asn Asp Glu Ala Ala Ala Glu Asp Glu Gin 
450 455 460 

Asp Ala Met Gin lie 
465 

<210> 8 
<211> 1422 
<212> DNA 

<213> Arabidopsis thai i ana 



<220> 

<221> CDS 

<222> (64).. (1389) 

<223> 



<400> 8 

atggcagaac taaggttatc agaaactcga gacttaacta ggatcgaaag aatcggagca 60 

cac tea cac ata cga ggt tta ggt etc gac tea gta etc gag cca cga 108 

ser His lie Arg Gly Leu Gly Leu Asp Ser val Leu Glu Pro Arg 

1 5 10 15 

gee gta tec gaa gga atg gtt ggt caa ate aaa gca cgt aaa gee gee 

Ala val ser Glu Gly Met Val Gly Gin He Lys Ala Arg Lys Ala Ala 
20 25 30 

gga gta ace etc gag ttg ate aga gac ggc aaa ate teg ggt egg get 

Gly val Thr Leu Glu Leu lie Arg Asp Gly Lys lie ser Gly Arg Ala 
35 40 45 

ata ctt ata gcg ggt caa ccc gga acg ggt aaa ate gca ata gca atg 

lie Leu He Ala Gly Gin Pro Gly Thr Gly Lys lie Ala lie Ala Met 
50 55 60 

ggt ata gca aaa tea ctt gga caa gaa aca cca ttc act atg att gca 

Gly lie Ala Lys ser Leu Gly Gin Glu Thr Pro Phe Thr Met lie Ala 



156 



204 



252 



300 
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65 70 75 

gga agt gag ate ttt tct tta gag atg tea aag act gaa get tta act 348 

11* " ^ ^ ' " - - - - 



Gly Ser Glu lie Phe Ser Leu Glu Met Ser Lys Thr Glu Ala Leu Thr 
80 85 90 95 

caa get ttt cgt aaa get att gat gtt agg ate aaa gaa gag act gac 
Gin Ala Phe Arg Lys Ala lie Gly Val Arg lie Lys Glu Glu Thr Asp 
100 105 110 



aaa gtg gca gag tgg aga gaa gaa ggg aaa get gaa ata gta cct ggt 
Lys val Ala Glu Trp Arg Glu Glu Gly Lys Ala Glu lie val Pro Gly 
260 265 270 



ttc ctg aat aga get etc gaa aac gat atg tea cca ate ctg gtc gtg 
Phe Leu Asn Arg Ala Leu Glu Asn Asp Met ser Pro lie Leu val val 
290 295 300 



396 



gtq ata gaa gga gaa gtt gtg acg att teg att gat aga cct get tct 444 
Val lie Glu Gly Glu val Val Thr lie ser He Asp Arg Pro Ala Ser 
115 120 125 

tct ggt gqt tct gtg aag aag act ggg aag ata aca atg aag acg act 492 
ser Gly Gly ser Val Lys Lys Thr Gly Lys He Thr Met Lys Thr Thr 
130 135 140 

gat atg gaa tct aat ttt gat ttg gga tgg aaa ttg att gag cca ttg 540 
Asp Met Glu Ser Asn Phe Asp Leu Gly Trp Lys Leu lie Glu Pro Leu 
145 150 155 

gat aag gag aaa gta cag agt ggt gat gtt att gtt ttg gat agg ttt 588 
Asp Lys Glu Lys val Gin Ser Gly Asp Val lie val Leu Asp Arg Phe 
160 165 170 175 

tgt ggg aag att act aag ctt gga aga tct ttt acg agg tct aga gat 636 
cys Gly Lys lie Thr Lys Leu Gly Arg Ser Phe Thr Arg Ser Arg Asp 
180 185 ~ 190 

ttt gat gtt atg ggt tea aag act aag ttt gtg cag tgc cct gaa ggt 684 
Phe Asp val Met Gly Ser Lys Thr Lys Phe val Gin cys Pro Glu Gly 
195 200 205 

gag ctt gag aag agg aag gag gtt ttg cat tct gtc aca ctt cat gag 732 
Glu Leu Glu Lys Arg Lys Glu Val Leu His ser Val Thr Leu His Glu 
210 215 220 

att gat gtt att aat age agg act caa ggg tat eta gee etc ttc aca 780 
lie Asp Val lie Asn ser Arg Thr Gin Gly Tyr Leu Ala Leu Phe Thr 
225 230 235 

ggt gat aca ggc gag att cgt tea gaa acc cga gag caa age gat act 828 
Gly Asp Thr Gly Glu lie Arg Ser Glu Thr Arg Glu Gin Ser Asp Thr 
240 245 250 255 



876 



gtt etc ttc att gat gaa gtc cat atg ctt gat ate gaa tgc ttc tct 924 
Val Leu Phe lie Asp Glu val His Met Leu Asp lie Glu cys Phe Ser 
275 280 285 



972 



get aca aac aga gga atg aca aca ate cga gga aca aac cag ata tea 1020 
Ala Thr Asn Arg Gly Met Thr Thr lie Arg Gly Thr Asn Gin He ser 
305 310 315 

gca cat ggg ate cca ate gat ttt ctt gac cgt ctt ctt att ate aca 1068 
Ala His Gly He Pro lie Asp Phe Leu Asp Arg Leu Leu lie lie Thr 
320 325 330 335 
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aca cag cct tac aca caa gac gag ate aga aat att tta gag ate cgt 1116 
Thr Gin Pro Tyr Thr Gin Asp Glu lie Arg Asn lie Leu Glu lie Arg 
340 345 350 



tgc caa gaa gag gat gtg gag atg aac gag gaa gcg aaa cag ctt ctg 1164 

Cys Gin Glu Glu Asp val Glu Met Asn Glu Glu Ala Lys Gin Leu Leu 
355 360 365 

act ttg ate gga tgt aat acc teg ctt agg tac gcg att cat eta ate 1212 

Thr Leu lie Gly Cys Asn Thr Ser Leu Arg Tyr Ala lie His Leu lie 

370 375 380 

aat gca get gee eta get tgc ctg aaa cgt aaa gqg aaa gtc gta gag 1260 

Asn Ala Ala Ala Leu Ala Cys Leu Lys Arg Lys Gly Lys val val Glu 

385 390 395 



att cag gac att gag aga gtt tat aga ttg ttt tta gac acc aag aga 1308 
lie Gin Asp lie Glu Arg val Tyr Arg Leu Phe Leu Asp Thr Lys Arg 
400 405 410 415 



teg atg cag tac ttg gtt gag cat gag age gag tac ttg ttt age gtg 1356 
Ser Met Gin Tyr Leu Val Glu His Glu ser Glu Tyr Leu Phe Ser Val 
420 425 430 

cct ata aaa aac aca cag gag get act gca gga gaagaaacag aacacgaggc 1409 
Pro lie Lys Asn Thr Gin Glu Ala Thr Ala Gly 
435 440 

catggaagtt tga 1422 

<210> 9 

<211> 442 

<212> PRT 

<213> Arabidopsis thai i ana 



<400> 9 

Ser His lie Arg Gly Leu Gly Leu Asp Ser Val Leu Glu Pro Arg Ala 
15 10 15 

Val ser Glu Gly Met val Gly Gin lie Lys Ala Arg Lys Ala Ala Gly 
20 25 30 

Val Thr Leu Glu Leu lie Arg Asp Gly Lys lie ser Gly Arg Ala lie 
35 40 45 

Leu He Ala Gly Gin Pro Gly Thr Gly Lys lie Ala lie Ala Met Gly 
50 55 60 

lie Ala Lys ser Leu Gly Gin Glu Thr Pro Phe Thr Met lie Ala Gly 
65 70 75 80 

Ser Glu lie Phe ser Leu Glu Met ser Lys Thr Glu Ala Leu Thr Gin 
85 90 95 
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Ala Phe Arg Lys Ala lie Gly Val Arg He Lys Glu Glu Thr Asp val 
100 105 110 

lie Glu Gly Glu Val Val Thr lie ser He Asp Arg Pro Ala Ser Ser 
115 120 125 

Gly Gly Ser Val Lys Lys Thr Gly Lys lie Thr Met Lys Thr Thr Asp 
130 135 140 

Met Glu Ser Asn Phe Asp Leu Gly Trp Lys Leu lie Glu Pro Leu Asp 
145 150 155 160 

Lys Glu Lys Val Gin ser Gly Asp val lie val Leu Asp Arg Phe cys 
165 170 175 

Gly Lys lie Thr Lys Leu Gly Arg Ser Phe Thr Arg ser Arg Asp Phe 
180 185 190 

Asp Val Met Gly Ser Lys Thr Lys Phe val Gin cys Pro Glu Gly Glu 
195 200 205 

Leu Glu Lys Arg Lys Glu Val Leu His Ser Val Thr Leu His Glu lie 
210 215 220 

Asp val lie Asn ser Arg Thr Gin Gly Tyr Leu Ala Leu Phe Thr Gly 
225 230 235 240 

Asp Thr Gly Glu lie Arg Ser Glu Thr Arg Glu Gin Ser Asp Thr Lys 
245 250 255 

Val Ala Glu Trp Arg Glu Glu Gly Lys Ala Glu lie val Pro Gly Val 
260 265 270 

Leu Phe lie Asp Glu Val His Met Leu Asp lie Glu cys Phe ser Phe 
275 280 285 

Leu Asn Arg Ala Leu Glu Asn Asp Met Ser Pro lie Leu val Val Ala 
290 295 300 

Thr Asn Arg Gly Met Thr Thr lie Arg Gly Thr Asn Gin He Ser Ala 
305 310 315 320 

His Gly lie Pro lie Asp Phe Leu Asp Arg Leu Leu lie lie Thr Thr 
325 330 335 

Gin Pro Tyr Thr Gin Asp Glu lie Arg Asn lie Leu Glu lie Arg cys 
340 345 350 

Gin Glu Glu Asp Val Glu Met Asn Glu Glu Ala Lys Gin Leu Leu Thr 
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355 360 365 

Leu lie Gly Cys Asn Thr Ser Leu Arg Tyr Ala lie His Leu He Asn 
370 375 380 

Ala Ala Ala Leu Ala Cys Leu Lys Arg Lys Gly Lys val val Glu He 
385 390 395 400 

Gin Asp lie Glu Arg val Tyr Arg Leu Phe Leu Asp Thr Lys Arg ser 
405 410 415 

Met Gin Tyr Leu Val Glu His Glu Ser Glu Tyr Leu Phe Ser Val Pro 
420 425 430 

lie Lys Asn Thr Gin Glu Ala Thr Ala Gly 
435 440 

<210> 10 

<211> 1591 

<212> DMA 

<213> Arabidopsis thai i ana 



<400> 10 
ccacgcgtcc 


gtaagaagat 


tttgccagtg 


cgggaagcgg 


cggagattga 


gagattagcg 


60 


acgatggagg 


aaagcaaaca 


gaactatgac 


ctgacgccac 


taatagcgcc 


taacctggac 


120 


agacacttgg 


tgtttcctat 


attcgagttc 


cttcaagagc 


gtcagcttta 


ccctgatgag 


180 


cagatcctga 


agtctaaaat 


ccagcttttg 


aaccagacga 


acatggttga 


ttacgccatg 


240 


gatattcaca 


agagtctcta 


ccacactgaa 


gacgctcctc 


aagaaatggt 


ggagagaaga 


300 


acagaggttg 


tcgctaggct 


caaatctttg 


gaggaggctg 


ctgcaccact 


cgtgtctttt 


360 


cttttgaacc 


ctaacgctgt 


gcaggagcta 


agagctgaca 


agcagtacaa 


tctccaaatg 


420 


ctcaaggaac 


gctaccagat 


tggtccagac 


cagattgagg 


ctttgtacca 


gtacgccaag 


480 


tttcagtttg 


aatgtggcaa 


ctattctggt 


gctgctgatt 


atctttacca 


gtacaggacc 


540 


ctgtgctcta 


accttgagag 


gagtttgagt 


gccttgtggg 


gaaagctcgc 


atctgaaata 


600 


ttgatgcaaa 


actgggatat 


tgctcttgaa 


gagcttaacc 


gtctcaaaga 


gattattgac 


660 


tcaaagagtt 


tttcatcgcc 


gttaaaccag 


gtgcagaaca 


ggatttggtt 


gatgcattgg 


720 


ggtctgtata 


tcttttttaa 


ccatgataat 


ggaaggacac 


agatcattga 


tctttttaac 


780 


caagacaagt 


atctgaatgc 


catccaaact 


agtgctccac 


acttgctgcg 


ctacttggca 


840 


actgctttca 


ttgtcaacaa 


aaggagaaga 


ccacaattga 


aagaattcat 


taaggtcatt 


900 


cagcaagagc 


actactccta 


caaagatcca 


attatcgagt 


tcctggcatg 


tgtgtttgtc 


960 
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aattatgact 


ttgatggggc 


tcaaaagaag 


atgaaagagt gtgaagaggt cattgtgaat 


1020 


gatccattcc 


ttggcaagcg 


agttgaggat 


ggaaactttt caactgtacc actgagagat 


1080 


gaatttcttg 


aaaatgcccg 


cctattcgtc 


tttgaaacct attgcaaaat tcatcaaagg 


1140 


attgacatgg 


gggtacttgc 


tgaaaaattg 


aatctgaact atgaggaggc cgagagatgg 


1200 


attgtgaacc 


taatccgcac 


ctcaaagctt 


gatgccaaga ttgattctga gtcaggaact 


1260 


gtaatcatgg 


agcctactca 


gcccaacgtg 


catgagcagt tgataaacca caccaaaggc 


1320 


ttatcaggac 


gaacatacaa 


gttagtgaat 


cagctcttgg aacacacaca ggcgcaagca 


1380 


actcgctagt 


caaaattttg 


ctgtggaagc 


ctttccttga taaaactcac cttcggttga 


1440 


ctggaattat 


tttctttttc 


ttgctctgag 


ttcacctttt actttgaaaa agattattat 


1500 


ggagttgttc 


tatttgaaat 


gttggatcca 


cagattggaa cattttccaa ccaaatcagc 


1560 


attttgtaaa 


aaaaaaaaaa 


aaaaaaaaaa 


a 


1591 



<210> 11 

<211> 1773 

<212> DNA 

<213> Arabidopsis thai i ana 
<220> 

<221> CDS 

<222> (1) . . (1773) 

<223> 

<400> 11 

atg gca gaa ctg eta ttt gag 
Met Ala Glu Leu Leu Phe Glu 
1 5 

gga gtc gat get get ttc age 
Gly val Asp Ala Ala Phe ser 
20 

tgt aaa aaa gat gga att gtt 
Cys Lys Lys Asp Gly lie val 



tec att ccg ttt gtc gac gga 
ser lie Pro Phe Val Asp Gly 
50 55 

act aac att ggt gga tat cat 
Thr Asn lie Gly Gly Tyr His 
65 70 

tea ctt aag tac cct ttt cat 
ser Leu Lys Tyr Pro Phe His 



act tat gga gtq cct gca gtt gca ttt 48 
Thr Tyr Gly Val Pro Ala Val Ala Phe 
10 15 

tac aaa tac aat caa eta cat gga att 96 
Tyr Lys Tyr Asn Gin Leu His Gly He 
25 30 

etc tgt cct gga ttc acg aca aca cac 144 

Leu Cys Pro Gly Phe Thr Thr Thr His 
40 45 

gaa cct ata tat aaa gga tec age cga 192 

Glu Pro He Tyr Lys Gly ser Ser Arg 
60 

gtc act gat tat tta aag cag ctt ctg 240 

val Thr Asp Tyr Leu Lys Gin Leu Leu 

¥ 75 80 

teg tct agg ttt aca tgg gag aag gee 288 
Ser Ser Arg Phe Thr Trp Glu Lys Ala 
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85 



90 



95 



gaa gat ttg aaa ttg gaa cac tgt tat ate gca cct gac tat get teg 
Glu Asp Leu Lys Leu Glu His Cys Tyr lie Ala Pro Asp Tyr Ala Ser 
100 105 110 

gaa att egg tta ttc cag gaa gga aga aaa gaa get gaa gag aaa aca 
Glu lie Arg Leu Phe Gin Glu Gly Arg Lys Glu Ala Glu Glu Lys Thr 
115 120 125 

agt tat tgg cag ctt cca tgg ata cct cct ccc acc gaa gtt cct cca 
Ser Tyr Trp Gin Leu Pro Trp lie Pro Pro Pro Thr Glu val Pro Pro 



tea gaa gaa gag att gca agg aag gca get ata aga gaa aaa caa ggt 
ser Glu Glu Glu lie Ala Arg Lys Ala Ala lie Arg Glu Lys Gin Gly 
145 150 155 160 

caa agg ctg cga gaa atg get gaa gca aag aga gtq tec aag att aat 
Gin Arg Leu Arg Glu Met Ala Glu Ala Lys Arg Val Ser Lys lie Asn 
165 170 175 

gac atg gag aat caa ctg att age ttg cgt ttc ctt ttg aag caa gtt 
Asp Met Glu Asn Gin Leu lie Ser Leu Arg Phe Leu Leu Lys Gin val 
180 185 190 

gac cag gtt gaa gag gat gat att cca acc ttt ttg tea gac acc ggt 
Asp Gin val Glu Glu Asp Asp lie Pro Thr Phe Leu Ser Asp Thr Gly 
195 200 205 

tac gcg tec agg caa gag eta gag tct act att acg aaa gtq aca cag 
Tyr Ala ser Arg Gin Glu Leu Glu ser Thr lie Thr Lys Val Thr Gin 
210 215 220 

teg ctt aga aaa gca agg ggt gag ccg aag aat gaa cca get gag tat 
Ser Leu Arg Lys Ala Arg Gly Glu Pro Lys Asn Glu Pro Ala Glu Tyr 
225 230 235 240 

gaa gaa aac cct gat tct ctt aat aat gaa aag tat cca ctt atg aat 
Glu Glu Asn Pro Asp Ser Leu Asn Asn Glu Lys Tyr Pro Leu Met Asn 
245 250 255 

gtc ccc gat gat att ctt act cct gag cag ctt aag gac aag aag agg 
Val Pro Asp Asp lie Leu Thr Pro Glu Gin Leu Lys Asp Lys Lys Arg 
260 265 270 

caa atg ttt ctt aaa aca aca gca gag ggc egg eta cga get aga caa 
Gin Met Phe Leu Lys Thr Thr Ala Glu Gly Arg Leu Arg Ala Arg Gin 
275 280 285 

aag cgt aat gag gag gaa etc gaa aaa gag aaa aga aat caa tta gag 
Lys Arg Asn Glu Glu Glu Leu Glu Lys Glu Lys Arg Asn Gin Leu Glu 
290 295 300 

gag gaa aga cgt cgt gag aac cca gag tct tac tta gag gag ttg caa 
Glu Glu Arg Arg Arg Glu Asn Pro Glu ser Tyr Leu Glu Glu Leu Gin 
305 310 315 320 

get cag tac aag gaa gtq ttg gag aga gtt gag cag aag aag cgt ctg 
Ala Gin Tyr Lys Glu val Leu Glu Arg val Glu Gin Lys Lys Arg Leu 
325 330 335 

aaa aca aac gqg tec agt aac gqg aat aac aag tct gga ggt att ggg 
Lys Thr Asn Gly ser ser Asn Gly Asn Asn Lys Ser Gly Gly lie Gly 



336 
384 
432 
480 
528 
576 
624 
672 
720 
768 
816 
864 
912 
960 





340 



345 



350 
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cga ggc gag cga etc agt get gca cag agg gag aga atg cgt ctg ctg 1104 
Arg Gly Glu Arg Leu ser Ala Ala Gin Arg Glu Arg Met Arg Leu Leu 
355 360 365 

acg aca gca gec ttt gat aga ggg aaa ggc gag gat acg ttt ggt tct 1152 
Thr Thr Ala Ala Phe Asp Arg Gly Lys Gly Glu Asp Thr Phe Gly ser 
370 375 380 

aga gat gaa gat tgg cag etc tac aaa ctt atg age aag gat aat gac 1200 
Arg Asp Glu Asp Trp Gin Leu Tyr Lys Leu Met Ser Lys Asp Asn Asp 
385 390 395 400 

gat gat gac gaa caa cct gat tea gac gag gca gag ttg get cgt tta 1248 
Asp Asp Asp Glu Gin Pro Asp Ser Asp Glu Ala Glu Leu Ala Arg Leu 
405 410 415 



tea tct aga ctt cag gaa att gat cca aca ttt gtg cag aaa gta gaa 1296 
Ser Ser Arg Leu Gin Glu lie Asp Pro Thr Phe Val Gin Lys Val Glu 
420 425 430 



gga gaa ttg agt cag aca tea ggg gag gtg cca cgc gta cgc cca tta 

Gly Glu Leu Ser Gin Thr Ser Gly Glu val Pro Arg Val Arg Pro Leu 
435 440 445 

aca gag gaa gac tac aag ata gtg att ggt ata gaa aga ttc cgt tgc 

Thr Glu Glu Asp Tyr Lys lie Val lie Gly lie Glu Arg Phe Arg Cys 
450 455 460 



aac ttc ttg ggg aat gee ttt act aag atg gat tac gac gag aaa ggt 
Asn Phe Leu Gly Asn Ala Phe Thr Lys Met Asp Tyr Asp Glu Lys Gly 
565 570 575 



1344 



1392 



cca gag ate ctg ttc cat cca aac ctt att ggt att gac caa gta ggc 1440 
Pro Glu lie Leu Phe His Pro Asn Leu lie Gly lie Asp Gin val Gly 
465 470 475 480 

tta gac gag atg get ggc aca tea ate aga agg eta ccg cac gac gag 1488 
Leu Asp Glu Met Ala Gly Thr Ser lie Arg Arg Leu Pro His Asp Glu 
485 490 495 

aaa gag tta gag gag agg eta acg age teg ata eta atg acg ggc ggg 1536 
Lys Glu Leu Glu Glu Arg Leu Thr Ser Ser lie Leu Met Thr Gly Gly 
500 505 510 

tgt age ctt ctt cca ggg atg aac gag egg ttg gaa tgt ggg att agg 1584 
Cys Ser Leu Leu Pro Gly Met Asn Glu Arg Leu Glu Cys Gly lie Arg 
515 520 525 

atg ata aga cct tgc gga tea ccc att aac gtg gtt aga get atg gat 1632 
Met lie Arg Pro cys Gly ser Pro lie Asn Val val Arg Ala Met Asp 
530 535 540 

cca gtg ctg gat gca tgg cga gga gca tct gca ttt get get aat ttg 1680 
Pro val Leu Asp Ala Trp Arg Gly Ala Ser Ala Phe Ala Ala Asn Leu 
545 550 555 560 



1728 



gaa gat tgg ctt aga aat tat caa att cga tac aac tat ttg tga 1773 
Glu Asp Trp Leu Arg Asn Tyr Gin lie Arg Tyr Asn Tyr Leu 
580 585 590 

<210> 12 
<211> 590 
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<212> PRT 

<213> Arabidopsis thai i ana 



<400> 12 

Met Ala Glu Leu Leu Phe Glu Thr Tyr Gly val Pro Ala Val Ala Phe 
1 5 10 15 

Gly val Asp Ala Ala Phe Ser Tyr Lys Tyr Asn Gin Leu His Gly He 
20 25 30 

cys Lys Lys Asp Gly lie Val Leu Cys Pro Gly Phe Thr Thr Thr His 
35 40 45 

ser lie Pro Phe Val Asp Gly Glu Pro lie Tyr Lys Gly Ser ser Arg 
50 55 60 

Thr Asn lie Gly Gly Tyr His val Thr Asp Tyr Leu Lys Gin Leu Leu 
65 70 75 80 

Ser Leu Lys Tyr Pro Phe His ser Ser Arg Phe Thr Trp Glu Lys Ala 
85 90 95 

Glu Asp Leu Lys Leu Glu His Cys Tyr lie Ala Pro Asp Tyr Ala Ser 
100 105 110 

Glu lie Arg Leu Phe Gin Glu Gly Arg Lys Glu Ala Glu Glu Lys Thr 
115 120 125 

Ser Tyr Trp Gin Leu Pro Trp lie Pro Pro Pro Thr Glu Val Pro Pro 
130 135 140 

ser Glu Glu Glu lie Ala Arg Lys Ala Ala lie Arg Glu Lys Gin Gly 
145 150 155 160 

Gin Arg Leu Arg Glu Met Ala Glu Ala Lys Arg Val ser Lys lie Asn 
165 170 175 

Asp Met Glu Asn Gin Leu lie ser Leu Arg Phe Leu Leu Lys Gin Val 
180 185 190 

Asp Gin Val Glu Glu Asp Asp He Pro Thr Phe Leu Ser Asp Thr Gly 
195 200 205 

Tyr Ala Ser Arg Gin Glu Leu Glu ser Thr lie Thr Lys val Thr Gin 
210 215 220 

Ser Leu Arg Lys Ala Arg Gly Glu Pro Lys Asn Glu Pro Ala Glu Tyr 
225 230 235 240 
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Glu Glu Asn Pro Asp Ser Leu Asn Asn Glu Lys Tyr Pro Leu Met Asn 
245 250 255 

Val Pro Asp Asp He Leu Thr Pro Glu Gin Leu Lys Asp Lys Lys Arg 
260 265 270 



Gin Met Phe Leu Lys Thr Thr Ala Glu Gly Arg Leu Arg Ala Arg Gin 
275 280 285 



Lys Arg Asn Glu Glu Glu Leu Glu Lys Glu Lys Arg Asn Gin Leu Glu 
290 295 300 



Glu Glu Arg Arg Arg Glu Asn Pro Glu Ser Tyr Leu Glu Glu Leu Gin 
305 310 315 320 



Ala Gin Tyr Lys Glu Val Leu Glu Arg Val Glu Gin Lys Lys Arg Leu 



325 



330 



33! 



Lys Thr Asn Gly Ser Ser Asn Gly Asn Asn Lys Ser Gly Gly lie Gly 
340 345 350 

Arg Gly Glu Arg Leu Ser Ala Ala Gin Arg Glu Arg Met Arg Leu Leu 
355 360 365 

Thr Thr Ala Ala Phe Asp Arg Gly Lys Gly Glu Asp Thr Phe Gly Ser 
370 375 380 



Arg Asp Glu Asp Trp Gin Leu Tyr Lys Leu Met Ser Lys Asp Asn Asp 
385 390 395 400 



Asp Asp Asp Glu Gin Pro Asp Ser Asp Glu Ala Glu Leu Ala Arg Leu 
405 410 415 



ser ser Arg Leu Gin Glu lie Asp Pro Thr Phe Val Gin Lys val Glu 
420 425 430 

Gly Glu Leu Ser Gin Thr ser Gly Glu val Pro Arg Val Arg Pro Leu 
435 440 445 

Thr Glu Glu Asp Tyr Lys lie Val lie Gly lie Glu Arg Phe Arg Cys 
450 455 * 460 

Pro Glu lie Leu Phe His Pro Asn Leu lie Gly lie Asp Gin Val Gly 
465 470 475 480 



Leu Asp Glu Met Ala Gly Thr Ser He Arg Arg Leu Pro His Asp Glu 
485 490 495 
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Lys Glu Leu Glu Glu Arg Leu Thr ser Ser lie Leu Met Thr Gly Gly 
500 " 505 510 

Cys ser Leu Leu Pro Gly Met Asn Glu Arg Leu Glu Cys Gly lie Arg 
515 520 525 

Met lie Arg Pro cys Gly Ser Pro lie Asn Val Val Arg Ala Met Asp 
530 535 540 

Pro Val Leu Asp Ala Trp Arg Gly Ala Ser Ala Phe Ala Ala Asn Leu 
545 550 555 560 

Asn Phe Leu Gly Asn Ala Phe Thr Lys Met Asp Tyr Asp Glu Lys Gly 
565 570 575 

Glu Asp Trp Leu Arg Asn Tyr Gin lie Arg Tyr Asn Tyr Leu 
580 585 590 

<210> 13 
<211> 1416 
<212> DNA 

<213> Arabidopsis thai i ana 



<220> 

<221> CDS 

<222> (1) . . (1416) 

<223> 

<400> 13 

atg ate ctg aag aaa gta tgg gga teg gtq tgg aac cga teg aat agt 48 
Met lie Leu Lys Lys Val Trp Gly ser Val Trp Asn Arg ser Asn ser 
1 5 10 15 

gqc aag gat ttg gtq aat cat cag aga gcg ate gat gtt cct cct ctg 96 
Gly Lys Asp Leu val Asn His Gin Arg Ala lie Asp val Pro Pro Leu 
20 25 30 

tta ttg tct tct tea teg tct ctt gqt gcg ttt gat cag eta ccg atg 144 
Leu Leu ser ser ser ser ser Leu Gly Ala Phe Asp Gin Leu Pro Met 
35 40 45 

gat att eta gtc cag ata ctg atg atg atg gag cca aaa gat get gtq 192 
Asp lie Leu val Gin lie Leu Met Met Met Glu Pro Lys Asp Ala val 
50 55 60 

aaa ttg gqc tta acg tgc aaa gee tgg aaa tgc gta get agt gqt aat 240 
Lys Leu Gly Leu Thr cys Lys Ala Trp Lys cys Val Ala ser Gly Asn 
65 70 75 80 

cgt etc tgg ata ttt tat etc cag tgt tct caa gag cca tgg gac tec 288 
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Arg Leu Trp lie Phe Tyr Leu Gin Cys Ser Gin Glu Pro Trp Asp Ser 
85 90 95 



att ttc ttc get gaa act agt ttg cgt tct ggt tat cct etc cga atg 336 
lie Phe Phe Ala Glu Thr Ser Leu Arg Ser Gly Tyr Pro Leu Arg Met 
100 105 110 



att tct agt caa tea gga gag ttg teg ttt atg cac att tat agt cag 384 
He Ser ser Gin ser Gly Glu Leu Ser Phe Met His lie Tyr Ser Gin 
115 120 125 

agg gca caa gtt cct ggt tct ate att att gat ggt ggt tct gga tat 432 
Arg Ala Gin val Pro Gly ser lie lie lie Asp Gly Gly ser Gly Tyr 
130 135 140 

tgt aag ttt gqt tgg age aag tat gcg tct cct tct gga cgt tct get 480 
Cys Lys Phe Gly Trp Ser Lys Tyr Ala ser Pro Ser Gly Arg ser Ala 
145 150 155 160 

act ttt ttg gaa ttt ggt aac att gag tea ccg att tat get aga ctt 528 
Thr Phe Leu Glu Phe Gly Asn lie Glu Ser Pro lie Tyr Ala Arg Leu 
165 170 175 

caa cag ttc ttt gca acc att ttc ace agg atg cag gta aag ccc tct 576 
Gin Gin Phe Phe Ala Thr lie Phe Thr Arg Met Gin val Lys Pro Ser 
180 185 190 

atg cag cca ata gtg gta tea eta cct etc tgc cat ttt gat gat act 624 
Met Gin Pro lie Val val Ser Leu Pro Leu cys His Phe Asp Asp Thr 
195 200 205 

gaa tea gee aag gca tea agg egg caa ctt aag act get att ttc aat 672 
Glu Ser Ala Lys Ala Ser Arg Arg Gin Leu Lys Thr Ala He Phe Asn 
210 215 220 



gtc ttg ttt gac atg aac gtc cct gca gtg tgt gca gtt aat cag get 720 
val Leu Phe Asp Met Asn Val Pro Ala val cys Ala Val Asn Gin Ala 
225 230 235 240 



gtg tta get eta tat gca gca egg egg aca tct gga att gtt gtc aac 768 
Val Leu Ala Leu Tyr Ala Ala Arg Arg Thr ser Gly lie val val Asn 
245 250 255 

att ggt ttc caa gtc ate acc att ctt ccg att tta cat ggt aag gtg 816 
lie Gly Phe Gin Val He Thr lie Leu Pro lie Leu His Gly Lys val 
260 265 270 

atg cgc cag gta ggt gta gaa gtc att ggt ttt gga gca ttg aaa etc 864 
Met Arg Gin val Gly Val Glu Val lie Gly Phe Gly Ala Leu Lys Leu 
275 280 285 

acg ggc ttc ctt aag gag aag atg caa gaa aac aac att tec ttt caa 912 
Thr Gly Phe Leu Lys Glu Lys Met Gin Glu Asn Asn lie ser Phe Gin 
290 295 300 

tea etc tac act gtt cgt act ctt aaa gag aaa ctg tgt tat gtg get 960 
ser Leu Tyr Thr Val Arg Thr Leu Lys Glu Lys Leu cys Tyr val Ala 
305 310 315 320 

ctt gat tat aaa get gaa ctt tea aaa gac aca caa get tea gtg gaa 1008 
Leu Asp Tyr Lys Ala Glu Leu ser Lys Asp Thr Gin Ala Ser Val Glu 
325 330 335 

gtt tea ggt gaa gga tgg ttt act tta tea aaa gag cgt ttc caa aca 
val Ser Gly Glu Gly Trp Phe Thr Leu Ser Lys Glu Arg Phe Gin Thr 
340 345 350 



1056 
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ggg gag ata tta ttc caa cca cgt etc get gga atg cgt gca atg agt 1104 
Gly Glu lie Leu Phe Gin Pro Arg Leu Ala Gly Met Arg Ala Met Ser 
355 360 365 



ctg cac cag gec gtc teg etc tgt atg gac cac tgt gat gca gca gga 

Leu His Gin Ala val ser Leu cys Met Asp His Cys Asp Ala Ala Gly 
370 375 380 

ctt aca ggt gac gat agt tgg ttc aag act gta gta eta act gqg gga 

Leu Thr Gly Asp Asp Ser Trp Phe Lys Thr Val val Leu Thr Gly Gly 

385 390 395 400 



cct tac ggc gtg gac aca tea tgg cat gqg gca aag ctt att agt 
Pro Tyr Gly Val Asp Thr Ser Trp His Gly Ala Lys Leu lie Ser 
435 440 445 



1152 



1200 



age gcg tgt ttg cct gga etc tea gag agg eta gag aga gaa ctg caa 1248 
ser Ala Cys Leu Pro Gly Leu ser Glu Arg Leu Glu Arg Glu Leu Gin 
405 410 415 

gat cac ctt cct tea tct ata agt aac gga ate aga gta ata cct cct 1296 
Asp His Leu Pro Ser ser lie Ser Asn Gly lie Arg val lie Pro Pro 
420 425 430 



aat 1344 
Asn 



ttg age ate ttt cct ggt cct tgg tgt ate aca agg aag cag ttc cgt 1392 
Leu Ser lie Phe Pro Gly Pro Trp cys lie Thr Arg Lys Gin Phe Arg 
450 455 460 

cgc aag tea aga etc atg tgg tga 1416 
Arg Lys Ser Arg Leu Met Trp 
465 470 

<210> 14 

<211> 471 

<212> PRT 

<213> Arabidopsis thai i ana 



<400> 14 

Met He Leu Lys Lys val Trp Gly ser val Trp Asn Arg Ser Asn Ser 
1 5 10 15 

Gly Lys Asp Leu val Asn His Gin Arg Ala He Asp val Pro Pro Leu 
20 25 30 

Leu Leu Ser ser ser Ser Ser Leu Gly Ala Phe Asp Gin Leu Pro Met 
35 40 45 

Asp lie Leu Val Gin lie Leu Met Met Met Glu Pro Lys Asp Ala Val 
50 55 60 

Lys Leu Gly Leu Thr cys Lys Ala Trp Lys cys val Ala ser Gly Asn 
65 70 75 80 
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Arg Leu Trp lie Phe Tyr Leu Gin Cys ser Gin Glu Pro Trp Asp Ser 
85 90 95 

lie Phe Phe Ala Glu Thr ser Leu Arg Ser Gly Tyr Pro Leu Arg Met 
100 105 110 

lie Ser Ser Gin Ser Gly Glu Leu Ser Phe Met His lie Tyr ser Gin 
115 120 125 

Arg Ala Gin Val Pro Gly Ser lie lie lie Asp Gly Gly Ser Gly Tyr 
130 135 140 

cys Lys Phe Gly Trp ser Lys Tyr Ala ser Pro Ser Gly Arg ser Ala 
145 150 155 160 

Thr Phe Leu Glu Phe Gly Asn lie Glu ser Pro lie Tyr Ala Arg Leu 
165 170 175 

Gin Gin Phe Phe Ala Thr lie Phe Thr Arg Met Gin val Lys Pro Ser 
180 185 190 

Met Gin Pro lie Val val ser Leu pro Leu Cys His Phe Asp Asp Thr 
195 200 205 

Glu ser Ala Lys Ala Ser Arg Arg Gin Leu Lys Thr Ala lie Phe Asn 
210 215 220 

val Leu Phe Asp Met Asn val pro Ala val cys Ala Val Asn Gin Ala 
225 230 235 240 

Val Leu Ala Leu Tyr Ala Ala Arg Arg Thr ser Gly lie val val Asn 
245 250 255 

lie Gly Phe Gin Val lie Thr lie Leu Pro lie Leu His Gly Lys val 
260 265 270 

Met Arg Gin Val Gly Val Glu Val lie Gly Phe Gly Ala Leu Lys Leu 
275 280 285 

Thr Gly Phe Leu Lys Glu Lys Met Gin Glu Asn Asn lie Ser Phe Gin 
290 295 300 

Ser Leu Tyr Thr Val Arg Thr Leu Lys Glu Lys Leu Cys Tyr Val Ala 
305 310 315 320 

Leu Asp Tyr Lys Ala Glu Leu ser Lys Asp Thr Gin Ala ser val Glu 
325 330 335 

Val Ser Gly Glu Gly Trp Phe Thr Leu ser Lys Glu Arg Phe Gin Thr 
340 345 350 
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Gly Glu lie Leu Phe Gin Pro Arg Leu Ala Gly Met Arg Ala Met Ser 
355 360 365 

Leu His Gin Ala val Ser Leu cys Met Asp His cys Asp Ala Ala Gly 
370 375 380 

Leu Thr Gly Asp Asp Ser Trp Phe Lys Thr val val Leu Thr Gly Gly 
385 390 395 400 

Ser Ala Cys Leu Pro Gly Leu Ser Glu Arg Leu Glu Arg Glu Leu Gin 
405 410 415 

Asp His Leu Pro ser ser lie Ser Asn Gly lie Arg val lie Pro Pro 
420 425 430 

Pro Tyr Gly val Asp Thr Ser Trp His Gly Ala Lys Leu lie Ser Asn 
435 440 445 

Leu Ser lie Phe Pro Gly Pro Trp Cys lie Thr Arg Lys Gin Phe Arg 
450 455 460 



Arg Lys Ser Arg Leu Met Trp 
465 470 



<210> 15 

<211> 1092 

<212> DMA 

<213> Arabidopsis thai i ana 
<220> 

<221> CDS 

<222> (1)..(1092) 

<223> 



<400> 15 

atg gaa gca eta gtt gtt gat get ggc tct aag ttc ctg aaa gca gga 48 

Met Glu Ala Leu Val Val Asp Ala Gly ser Lys Phe Leu Lys Ala Gly 
1 5 10 15 



gca gca att cct gac cag tct cct gca atg ata att ccc tct caa atg 
Ala Ala lie Pro Asp Gin Ser Pro Ala Met He He Pro Ser Gin Met 
20 25 30 



96 



aaa cga atg gtt gat gat ggg tct tct tea get gat aac ccc acc act 144 
Lys Arg Met Val Asp Asp Gly Ser Ser Ser Ala Asp Asn Pro Thr Thr 
35 40 45 
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gtc ttt gag gat gtc act ctt gat cct att gaa agg ggt ttg att aga 192 
val Phe Glu Asp val Thr Leu Asp Pro lie Glu Arg Gly Leu lie Arg 
50 55 60 

gat tgg gat get atg gaa gat ctg ttg cgt tat gtt gtc tac act ggg 240 
Asp Trp Asp Ala Met Glu Asp Leu Leu Arg Tyr val Val Tyr Thr Gly 
65 70 75 80 



ctt gga tgg gaa gag gga aac gaa ggc aat ata ctt ttt aca gat cca 
Leu Gly Trp Glu Glu Gly Asn Glu Gly Asn lie Leu Phe Thr Asp Pro 
85 90 95 



cat ggg aag ata gat att gec cca gtt ctt gaa ggt gca gta caa cac 
His Gly Lys He Asp lie Ala Pro val Leu Glu Gly Ala Val Gin His 
145 150 155 160 



ctt cct gat gga cag gtg ata age ate ggg cga gag aga tac teg gtt 
Leu Pro Asp Gly Gin Val lie Ser lie Gly Arg Glu Arg Tyr Ser Val 
225 230 235 240 

gga gaa get ctg ttt cag cca tea ata ctg gga ctg gag gag cat gga 
Gly Glu Ala Leu Phe Gin Pro ser lie Leu Gly Leu Glu Glu His Gly 
245 250 255 



288 



ctt tgt act cct aag get att agg gag caa ttg gtg cag ttg atg ttt 336 
Leu Cys Thr Pro Lys Ala lie Arg Glu Gin Leu Val Gin Leu Met Phe 
100 105 110 

gaa aca ttc aat gtc tct gga ttt tat gca tct gag caa gca gtg ttg 384 
Glu Thr Phe Asn Val Ser Gly Phe Tyr Ala Ser Glu Gin Ala Val Leu 
115 120 125 

tec ctt tat get gtt gga cgc ate tec ggt tgc act gtt gat att ggt 432 
Ser Leu Tyr Ala Val Gly Arg lie Ser Gly Cys Thr val Asp lie Gly 
130 135 140 



480 



att gee teg aaa egg ttt gag eta ggt gga acc gag eta act aaa tta 528 
lie Ala ser Lys Arg Phe Glu Leu Gly Gly Thr Glu Leu Thr Lys Leu 
165 170 175 



ttt gec caa gag ctt gga aaa acc aac ccg teg atg aat etc age atg 576 
Phe Ala Gin Glu Leu Gly Lys Thr Asn Pro Ser Met Asn Leu ser Met 
180 185 190 

tct gat gtt gaa aaa etc aag gag cag tat gca aac tgt gec gag gac 624 
Ser Asp val Glu Lys Leu Lys Glu Gin Tyr Ala Asn cys Ala Glu Asp 
195 200 205 

gaa att get tac aaa aaa acc caa aac tgt gaa ate gag cag cat act 672 
Glu lie Ala Tyr Lys Lys Thr Gin Asn cys Glu lie Glu Gin His Thr 
210 215 220 



720 



768 



ate gtt gag cag ctt gtc egg att ate tec aca gtg tea tct gag aac 816 
lie val Glu Gin Leu Val Arg He lie Ser Thr Val ser ser Glu Asn 
260 " 265 270 

cat agg cag etc ttg gag aac act gta ctt tgt ggt ggt aca acc tec 864 
His Arg Gin Leu Leu Glu Asn Thr Val Leu cys Gly Gly Thr Thr ser 
275 280 285 

atg aca gga ttc gaa agt aga ttc cag aaa gaa gca aac ttg tgc tea 912 
Met Thr Gly Phe Glu Ser Arg Phe Gin Lys Glu Ala Asn Leu cys Ser 
290 295 300 

tct gee att agg cca aca ctg gtg aaa ccg cca gaa tat atg ccg gag 960 
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Ser Ala lie Arg Pro Thr Leu Val Lys Pro Pro Glu Tyr Met Pro Glu 
305 310 315 320 

aat ttg gqg atg tat teg get tgg gtt gga gga gec ata eta get aaa 1008 
Asn Leu Gly Met Tyr ser Ala Trp Val Gly Gly Ala lie Leu Ala Lys 
325 330 335 

gtg gtg ttt ccg cag aat cag cac gtt act aaa gca gat tat gac gag 1056 
VaT val Phe Pro Gin Asn Gin His Val Thr Lys Ala Asp Tyr Asp Glu 
340 345 350 

act gga cca tea gtg gtt cac agg aaa tgt ttc tga 1092 
Thr Gly Pro Ser Val Val His Arg Lys cys Phe 
355 36' 

<210> 16 

<211> 363 

<212> PRT 

<213> Arabidopsis thai i ana 



<400> 16 

Met Glu Ala Leu Val val Asp Ala Gly ser Lys phe Leu Lys Ala Gly 
1 5 10 15 

Ala Ala lie Pro Asp Gin Ser Pro Ala Met lie lie Pro ser Gin Met 
20 25 30 

Lys Arg Met Val Asp Asp Gly ser ser Ser Ala Asp Asn Pro Thr Thr 
35 40 45 

val Phe Glu Asp val Thr Leu Asp Pro lie Glu Arg Gly Leu lie Arg 
50 55 60 

Asp Trp Asp Ala Met Glu Asp Leu Leu Arg Tyr val Val Tyr Thr Gly 
65 70 75 80 

Leu Gly Trp Glu Glu Gly Asn Glu Gly Asn He Leu Phe Thr Asp Pro 
85 90 95 

Leu Cys Thr Pro Lys Ala lie Arg Glu Gin Leu val Gin Leu Met Phe 
100 105 110 

Glu Thr Phe Asn val Ser Gly Phe Tyr Ala ser Glu Gin Ala val Leu 
115 120 125 

ser Leu Tyr Ala val Gly Arg lie ser Gly cys Thr val Asp lie Gly 
130 135 140 



His Gly Lys lie Asp lie Ala Pro val Leu Glu Gly Ala Val Gin His 
145 150 155 160 
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lie Ala Ser Lys Arg Phe Glu Leu Gly Gly Thr Glu Leu Thr Lys Leu 
165 170 175 

Phe Ala Gin Glu Leu Gly Lys Thr Asn Pro ser Met Asn Leu Ser Met 
180 185 190 

ser Asp val Glu Lys Leu Lys Glu Gin Tyr Ala Asn cys Ala Glu Asp 
195 200 205 

Glu lie Ala Tyr Lys Lys Thr Gin Asn cys Glu lie Glu Gin His Thr 
210 215 220 

Leu Pro Asp Gly Gin Val lie ser lie Gly Arg Glu Arg Tyr ser Val 
225 230 235 240 

Gly Glu Ala Leu Phe Gin Pro Ser lie Leu Gly Leu Glu Glu His Gly 
245 250 255 

lie Val Glu Gin Leu val Arg He lie Ser Thr Val ser ser Glu Asn 
260 265 270 

His Arg Gin Leu Leu Glu Asn Thr Val Leu Cys Gly Gly Thr Thr Ser 
275 280 285 

Met Thr Gly Phe Glu Ser Arg Phe Gin Lys Glu Ala Asn Leu cys Ser 
290 295 300 

Ser Ala lie Arg pro Thr Leu Val Lys Pro Pro Glu Tyr Met Pro Glu 
305 310 315 320 

Asn Leu Gly Met Tyr Ser Ala Trp Val Gly Gly Ala lie Leu Ala Lys 
325 330 335 

Val Val Phe Pro Gin Asn Gin His Val Thr Lys Ala Asp Tyr Asp Glu 
340 345 350 

Thr Gly Pro ser val val His Arg Lys cys Phe 
355 360 

<210> 17 

<211> 1791 

<212> DNA 

<213> Arabidopsis thai i ana 



<220> 
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<221> CDS 

<222> (1)..(1791) 

<223> 



<400> 17 

atg gca aaa cca aaa tec aat tct cat etc teg tgg cag gat tac ttg 48 
Met Ala Lys Pro Lys Ser Asn ser His Leu Ser Trp Gin Asp Tyr Leu 
15 10 15 

aaa act gtt gec ccc act cag att ttg tct gaa cga gqa get aat etc 96 
Lys Thr val Ala Pro Thr Gin lie Leu Ser Glu Arg Gly Ala Asn Leu 
20 25 30 

gtt gtc ate aac tta gqg tct get aat gta aga gta gqa ttg get atg 144 
val val lie Asn Leu Gly ser Ala Asn val Arg val Gly Leu Ala Met 
35 40 45 

gat gag aaa cct ttc aat gtt cct aat tgt att gca aga tac att aca 192 
Asp Glu Lys Pro Phe Asn val Pro Asn Cys lie Ala Arg Tyr lie Thr 
50 55 60 

caa tct gqg aaa cca act gtt gtt gat cag atg ctt aac act gag gtc 240 
Gin ser Gly Lys Pro Thr val val Asp Gin Met Leu Asn Thr Glu val 
65 70 75 80 

act act aat caa cat gtt gac cgt gag aga get tat aat tct gcg gca 288 
Thr Thr Asn Gin His Val Asp Arg Glu Arg Ala Tyr Asn Ser Ala Ala 
85 90 95 

teg ctg ttg aag ata ctg ttc ctg gat gaa agt tct tct tct ggc tct 336 
ser Leu Leu Lys lie Leu Phe Leu Asp Glu Ser Ser Ser Ser Gly ser 
100 105 110 

gca tct cgc aag atg gqg agg ate gac gqa tac aat caa gcg agt acg 384 
Ala Ser Arg Lys Met Gly Arg lie Asp. Gly Tyr Asn Gin Ala Ser Thr 
115 120 125 

ata aag aaa gat tea gtc ttt act tgg act gat gtq tat gag gat gag 432 
lie Lys Lys Asp ser Val Phe Thr Trp Thr Asp Val Tyr Glu Asp Glu 
130 135 140 

aaa att tct ctt gee tea cca gca gaa act tct cct gat aaa gqt gat 480 
Lys lie ser Leu Ala Ser Pro Ala Glu Thr Ser Pro Asp Lys Gly Asp 
145 150 155 160 



get agt gca tct gaa get gtt cct gat gtt act gac tct aaa gat act 528 
Ala ser Ala ser Glu Ala val Pro Asp Val Thr Asp ser Lys Asp Thr 
165 170 175 



agt gag agt aaa cgc aag tat agg aaa atg att ttt ggt gaa gaa get 576 
ser Glu ser Lys Arg Lys Tyr Arg Lys Met l7e Phe Gly Glu Glu Ala 
180 185 190 

ttg aaa ata teg cca aaa gag cca tat tgt tta tac cat cct att egg 624 
Leu Lys lie Ser Pro Lys Glu Pro Tyr Cys Leu Tyr His Pro lie Arg 
195 200 205 

aga gqt cac ttc aat gtt teg cca cat tat tea gcg caa egg gtt tgt 672 
Arg Gly His Phe Asn Val ser Pro His Tyr ser Ala Gin Arg Val cys 
210 215 220 

gag gac tta act get ate ttg gac tgg att tta tta gag aaa ctt cat 720 
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61 u Asp Leu Thr Ala lie Leu Asp Trp lie Leu Leu Glu Lys Leu His 
225 230 235 240 

ata act cac aag gag aga ttc tct ttt cat get gtt att gtt gtc cca 768 
lie Thr His Lys Glu Arg Phe ser Phe His Ala Val He val Val Pro 
245 250 255 

gaa aca ttt gac acc cgc gaa ata aag gaa atg eta act att gtg ttg 816 
Glu Thr Phe Asp Thr Arg Glu lie Lys Glu Met Leu Thr lie val Leu 
260 265 270 

gga gag eta tac ttt aac tea gca gtt gtc cac caa gaa ggt eta teg 
Gly Glu Leu Tyr Phe Asn Ser Ala val val His Gin Glu Gly Leu Ser 
275 280 285 



gtt cat tct tat gag gaa ggc atg cca get gtg cct cac aag aca aat 
Val His Ser Tyr Glu Glu Gly Met Pro Ala val Pro His Lys Thr Asn 
385 390 395 400 



tac gag aat atg ttg gaa gac act tgg aac atg gac ttt gga ggt ggt 
Tyr Glu Asn Met Leu Glu Asp Thr Trp Asn Met Asp Phe Gly Gly Gly 
435 440 445 



864 



gee gtt ttt ggg aat ggt ttg aca aca get tgt att gtg aat ata gga 912 

Ala Val Phe Gly Asn Gly Leu Thr Thr Ala cys lie Val Asn lie Gly 
290 295 300 

gee cag aca agt aca gta gtt tgt gtc gag gat ggg gtc tea ttg cca 960 

Ala Gin Thr Ser Thr Val Val Cys val Glu Asp Gly Val ser Leu Pro 
305 310 315 320 

aat act gaa aag att tta cct ttt gga gga gag gat ata tgt aga tgc 1008 

Asn Thr Glu Lys lie Leu Pro Phe Gly Gly Glu Asp lie cys Arg Cys 
325 330 335 

ctt eta tgg att cag agg cat tac caa aag tgg cca caa ate aac aca 1056 

Leu Leu Trp lie Gin Arg His Tyr Gin Lys Trp Pro Gin lie Asn Thr 

340 345 350 

gat gtt ttg gca aag cca ate gat atg ctg atg ctt aat caa ctt aag 1104 

Asp val Leu Ala Lys Pro lie Asp Met Leu Met Leu Asn Gin Leu Lys 
355 360 365 

gag tea ttt tgt gaa att aga gca gga gaa ctt gaa act gtt gca acg 1152 

Glu Ser Phe Cys Glu lie Arg Ala Gly Glu Leu Glu Thr Val Ala Thr 
370 375 380 



1200 



etc acc tec ctt aac gtt cca cca atg ggt ctg ttt tat cct aac ctt 1248 

Leu Thr Ser Leu Asn Val Pro Pro Met Gly Leu Phe Tyr Pro Asn Leu 
405 410 415 

ttg gtc cct gaa ata ttt ccc cag cca cca cgt caa tgg ttt caa gac 1296 

Leu val Pro Glu lie Phe Pro Gin Pro Pro Arg Gin Trp Phe Gin Asp 

420 425 430 



1344 



ggt aac atg gga tta cca atg tgg gat agt ttt gca ttt teg cct tea 1392 
Gly Asn Met Gly Leu Pro Met Trp Asp ser Phe Ala Phe ser Pro ser 
450 455 460 

aaa cca aag aaa gaa gag aag att ggt etc get gaa gee att aca age 1440 
Lys Pro Lys Lys Glu Glu Lys He Gly Leu Ala Glu Ala lie Thr Ser 
465 470 475 480 

age att etc tct get gga cgc ata gac ctt aga egg aag ctt ttc tec 1488 
Ser lie Leu Ser Ala Gly Arg He Asp Leu Arg Arg Lys Leu Phe Ser 
485 490 495 
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1536 



eta aag aag tac aaa gac tct tat cac ctt caa gqt caa gca atg tac 
Leu Lys Lys Tyr Lys Asp ser Tyr His Leu Gin Gly Gin Ala Met Tyr 
580 585 590 





595 


<210> 


18 


<211> 


596 


<212> 


PRT 


<213> 


Arabidopsis thai i ana 


<400> 


18 



Met Ala Lys Pro Lys Ser Asn Ser His Leu Ser Trp Gin Asp Tyr Leu 
15 10 15 

Lys Thr val Ala Pro Thr Gin lie Leu ser Glu Arg Gly Ala Asn Leu 
20 25 30 

Val Val lie Asn Leu Gly Ser Ala Asn val Arg Val Gly Leu Ala Met 

35 40 45 

Asp Glu Lys Pro Phe Asn val Pro Asn Cys lie Ala Arg Tyr lie Thr 
50 55 60 

Gin Ser Gly Lys Pro Thr val val Asp Gin Met Leu Asn Thr Glu val 
65 70 75 80 

Thr Thr Asn Gin His Val Asp Arg Glu Arg Ala Tyr Asn ser Ala Ala 
85 90 95 



1584 



age att caa ttg att gqt gqt get gqt ttg acg aaa gqt ctt gta gcg 

Ser lie Gin Leu lie Gly Gly Ala Gly Leu Thr Lys Gly Leu val Ala 

500 505 510 

gca gtq gaa gaa aga gtt ctt cac gcg ata cct cca act gaa gec att 

Ala Val Glu Glu Arg Val Leu His Ala He Pro Pro Thr Glu Ala lie 

515 520 525 

gat aca gtq cag gtt ctg cca tea aga acg gag cca caa ttc gta act 1632 

Asp Thr val Gin val Leu Pro Ser Arg Thr Glu Pro Gin Phe val Thr 

530 535 540 

tgg aaa gqa gqa gcg ata ttg gga att ctg gac ttt gqg agg gag get 1680 

Trp Lys Gly Gly Ala He Leu Gly lie Leu Asp Phe Gly Arg Glu Ala 
545 550 555 560 



tgg att gag aga cat caa tgg atg gta aat gqg gtt aat aaa gqt got 1728 
Trp lie Glu Arg His Gin Trp Met val Asn Gly val Asn Lys Gly Gly 
565 570 575 



1776 



ttc ate aac ccc tag 1791 
Phe lie Asn Pro 
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Ser Leu Leu Lys lie Leu Phe Leu Asp Glu ser ser Ser Ser Gly Ser 
100 105 110 

Ala ser Arg Lys Met Gly Arg lie Asp Gly Tyr Asn Gin Ala ser Thr 
115 120 125 

lie Lys Lys Asp Ser Val Phe Thr Trp Thr Asp val Tyr Glu Asp Glu 
130 135 140 

Lys lie ser Leu Ala ser Pro Ala Glu Thr ser Pro Asp Lys Gly Asp 
145 150 155 160 

Ala ser Ala ser Glu Ala val Pro Asp val Thr Asp ser Lys Asp Thr 
165 170 175 

Ser Glu ser Lys Arg Lys Tyr Arg Lys Met lie Phe Gly Glu Glu Ala 
180 185 190 

Leu Lys lie Ser Pro Lys Glu Pro Tyr Cys Leu Tyr His Pro lie Arg 
195 200 205 

Arg Gly His Phe Asn Val Ser Pro His Tyr Ser Ala Gin Arg Val cys 
210 215 220 

Glu Asp Leu Thr Ala lie Leu Asp Trp lie Leu Leu Glu Lys Leu His 
225 230 235 240 

lie Thr His Lys Glu Arg Phe ser Phe His Ala Val lie val Val Pro 
245 250 255 

Glu Thr Phe Asp Thr Arg Glu lie Lys Glu Met Leu Thr lie val Leu 
260 265 270 

Gly Glu Leu Tyr Phe Asn Ser Ala val Val His Gin Glu Gly Leu Ser 
275 280 285 

Ala val Phe Gly Asn Gly Leu Thr Thr Ala Cys lie Val Asn lie Gly 
290 295 300 

Ala Gin Thr Ser Thr Val val Cys val Glu Asp Gly val Ser Leu Pro 
305 310 315 320 

Asn Thr Glu Lys lie Leu Pro Phe Gly Gly Glu Asp lie Cys Arg cys 
325 330 335 

Leu Leu Trp He Gin Arg His Tyr Gin Lys Trp Pro Gin lie Asn Thr 
340 345 350 

Asp val Leu Ala Lys Pro lie Asp Met Leu Met Leu Asn Gin Leu Lys 
355 360 365 
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Glu ser Phe Cys Glu lie Arg Ala Gly Glu Leu Glu Thr val Ala Thr 
370 375 380 

val His Ser Tyr Glu Glu Gly Met Pro Ala val Pro His Lys Thr Asn 
385 390 395 400 

Leu Thr Ser Leu Asn val Pro Pro Met Gly Leu Phe Tyr Pro Asn Leu 
405 410 415 

Leu val Pro Glu lie Phe Pro Gin Pro Pro Arg Gin Trp Phe Gin Asp 
420 425 430 

Tyr Glu Asn Met Leu Glu Asp Thr Trp Asn Met Asp Phe Gly Gly Gly 
435 440 445 

Gly Asn Met Gly Leu Pro Met Trp Asp ser Phe Ala Phe ser Pro ser 
450 455 460 

Lys Pro Lys Lys Glu Glu Lys He Gly Leu Ala Glu Ala He Thr ser 
465 470 475 480 

ser lie Leu ser Ala Gly Arg lie Asp Leu Arg Arg Lys Leu Phe ser 
485 490 495 

Ser lie Gin Leu He Gly Gly Ala Gly Leu Thr Lys Gly Leu val Ala 
500 505 510 

Ala Val Glu Glu Arg Val Leu His Ala lie Pro Pro Thr Glu Ala lie 
515 ~ 520 525 

Asp Thr val Gin val Leu Pro Ser Arg Thr Glu Pro Gin Phe Val Thr 
530 535 540 

Trp Lys Gly Gly Ala lie Leu Gly lie Leu Asp Phe Gly Arg Glu Ala 
545 550 555 560 

Trp He Glu Arg His Gin Trp Met val Asn Gly val Asn Lys Gly Gly 
565 570 575 

Leu Lys Lys Tyr Lys Asp ser Tyr His Leu Gin Gly Gin Ala Met Tyr 
580 585 590 

Phe lie Asn Pro 
595 

<210> 19 
<211> 29 
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<212> DNA 

<213> Arabidopsis sp. 

<400> 19 

ctagaagctt ttaaggatta agactctcc 29 

<210> 20 

<211> 22 

<212> DNA 

<213> Arabidopsis sp. 

<400> 20 

ctcgtatgta tcccccttct cc 22 

<210> 21 

<211> 21 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 21 

tgatggatct atcaccatca g 21 

<210> 22 

<211> 21 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 22 

ggtgggattc caatcacttt c 21 

<210> 23 

<211> 21 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 23 

tgatggatct atcaccatca g 
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<210> 24 

<211> 21 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 24 

ggtgggattc caatcacttt c 21 

<210> 25 

<211> 19 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 25 

tttgatgggc caaatgatg 19 

<210> 26 

<211> 36 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 26 

cttccaacct aggtgagatg tttcaacaaa atgtgc 36 

<210> 27 

<211> 19 

<212> DNA 

<213> Arabidopsis thai i ana 

<400> 27 

tcaacagcag gacacaagg 19 

<210> 28 
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<212> DNA 

<213> Arabidopsis thai i ana 
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<400> 28 

cccaatgcct aggaaatccg agttcaacat cctaatc 37 

<210> 29 
<211> 23 
<212> DNA 

<213> Arabidopsis thai i ana 
<400> 29 

acaaaccaga tatcagcaca tgg 23 

<210> 30 

<211> 23 

<212> DNA 

<213> Arabidopsis thai i ana 



<400> 30 
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